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Preface 


The aim of this book is to introduce the reader to the fascinating world of 
convex polytopes. 

The highlights of the book are three main theorems in the combinatorial 
theory of convex polytopes, known as the Dehn-Sommerville Relations, the 
Upper Bound Theorem and the Lower Bound Theorem. All the background 
information on convex sets and convex polytopes which is needed to under- 
stand and appreciate these three theorems is developed in detail. This 
background material also forms a basis for studying other aspects of polytope 
theory. 

The Dehn-Sommerville Relations are classical, whereas the proofs of 
the Upper Bound Theorem and the Lower Bound Theorem are of more 
recent date: they were found in the early 1970’s by P. McMullen and D. 
Barnette, respectively. A famous conjecture of P. McMullen on the charac- 
terization of f-vectors of simplicial or simple polytopes dates from the same 
period; the book ends with a brief discussion of this conjecture and some of 
its relations to the Dehn—Sommerville Relations, the Upper Bound Theorem 
and the Lower Bound Theorem. However, the recent proofs that McMullen’s 
conditions are both sufficient (L. J. Billera and C. W. Lee, 1980) and necessary 
(R. P. Stanley, 1980) go beyond the scope of the book. 

Prerequisites for reading the book are modest: standard linear algebra and 
elementary point set topology in R¢ will suffice. 

The author is grateful to the many people who have contributed to the 
book: several colleagues, in particular Victor Klee and Erik Sparre Andersen, 
supplied valuable information; Aage Bondesen suggested essential improve- 
ments; students at the University of Copenhagen also suggested improve- 
ments; and Ulla Jacobsen performed an excellent typing job. 


Copenhagen ARNE BR@NDSTED 
February 1982 
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Introduction 


Convex polytopes are the d-dimensional analogues of 2-dimensional convex 
polygons and 3-dimensional convex polyhedra. The theme of this book is 
the combinatorial theory of convex polytopes. Generally speaking, the com- 
binatorial theory deals with the numbers of faces of various dimensions 
(vertices, edges, etc.). An example is the famous theorem of Euler, which states 
that for a 3-dimensional convex polytope, the number f, of vertices, the 
number /, of edges and the number f, of facets are connected by the relation 


fo-—fi + fa = 2. 


(In contrast to the combinatorial theory, there is a metric theory, dealing 
with such notions as length, angles and volume. For example, the concept 
of a regular polytope belongs to the metric theory.) 

The main text is divided into three chapters, followed by three appendices. 
The appendices supply the necessary background information on lattices, 
graphs and combinatorial identities. Following the appendices, and preceding 
the bibliography, there is a section with bibliographical comments. Each of 
Sections 1-15 ends with a selection of exercises. 

Chapter 1 (Sections 1-6), entitled “Convex Sets,” contains those parts of 
the general theory of d-dimensional convex sets that are needed in what 
follows. Among the basic notions are the convex hull, the relative interior 
of a convex set, supporting hyperplanes, faces of closed convex sets and 
polarity. (Among the basic notions of convexity theory not touched upon 
we mention convex cones and convex functions.) 

The heading of Chapter 2 (Sections 7-15) is “Convex Polytopes.” In 
Sections 7-11 we apply the general theory of convex sets developed in 
Chapter 1 to the particular case of convex polytopes. (It is the author’s 
belief that many properties of convex polytopes are only appreciated 
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when seen on the background of properties of convex sets in general.) In 
Sections 12—14 the important classes of simple, simplicial, cyclic and neigh- 
bourly polytopes are introduced. In Section 15 we study the graph determined 
by the vertices and edges of a polytope. 

Chapter 3 contains selected topics in the “Combinatorial Theory of 
Convex Polytopes.” We begin, in Section 16, with Euler’s Relation in its 
d-dimensional version. In Section17 we discuss the so-called Dehn- 
Sommerville Relations which are “Euler-type” relations, valid for simple 
or simplicial polytopes only. Sections 18 and 19 are devoted to the celebrated 
Upper Bound Theorem and Lower Bound Theorem, respectively; these 
theorems solve important extremum problems involving the numbers of 
faces (of various dimensions) of simple or simplicial polytopes. Finally, 
in Section 20 we report on a recent fundamental theorem which gives 
“complete information” on the numbers of faces (of various dimensions) 
of a simple or simplicial polytope. 


The following flow chart outlines the organization of the book. However, 
there are short cuts to the three main theorems of Chapter 3. To read the 
proof of the Dehn—Sommerville Relations (Theorem 17.1) only Sections 
1-12 and Euler’s Relation (Theorem 16.1) are needed; Euler’s Relation 
also requires Theorem 15.1. To read the proof of the Upper Bound Theorem 
(Theorem 18.1) only Sections 1-14 and Theorems 15.1-15.3 are needed. 
To read the Lower Bound Theorem (Theorem 19.1) only Sections 1-12 
and 15, and hence also Appendix 2, are needed. It is worth emphasizing that 
none of the three short cuts requires the somewhat technical Appendix 3. 
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Chapter 2 


CHAPTER 1 
Convex Sets 


$1. The Affine Structure of R* 


The theory of convex polytopes, and more generally the theory of convex 
sets, belongs to the subject of affine geometry. In a sense, the right framework 
for studying convex sets is the notion of a Euclidean space, i.e. a finite- 
dimensional real affine space whose underlying linear space is equipped 
with an inner product. However, there is no essential loss of generality in 
working only with the more concrete spaces R*; therefore, everything will 
take place in R*. We will assume that the reader is familiar with the standard 
linear theory of R‘, including such notions as subspaces, linear independence, 
dimension, and linear mappings. We also assume familiarity with the stan- 
dard inner product <-, -> of R%, including the induced norm ||-||, and elemen- 
tary topological notions such as the interior int M, the closure cl M, and 
the boundary bd M of a subset M of R’. 

The main purpose of this section is to give a brief survey of the affine 
structure of R“. We give no proofs here; the reader is invited to produce 
his own proofs, essentially by reducing the statements in the affine theory to 
statements in the linear theory. It is important that the reader feels at home 
in the affine structure of R’. 


For de N, we denote by Ré the set of all d-tuples x = (a,..., a4) of 
real numbers a,,..., %. We identify R' with R, and we define R° := {0}. 

We recall some basic facts about the linear structure of R*. Equipped 
with the standard linear operations, R* is a linear space. When the linear 
structure of R? is in the foreground, the elements of R® are called vectors. 
The zero vector is denoted by o. 
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A linear subspace is a non-empty subset L of R* such that 
(a) A,x, + A,X, isin L for all x,,x,¢€Land all 4,,4,€R. 


A linear combination of vectors x,,..., X, from R® is a vector of the form 
AiX, t+: +A, yX,, Where 4,,...,4, are in R. Corresponding to n = 0, 
we allow the empty linear combination with the value o. (In the definition 
of a linear combination there is a certain ambiguity. In some situations 
when talking about a linear combination 4,x, +---+4,x, we not only 
think of the vector x = A,x, +---+4,x,, but also of the particular co- 
efficients A,,..., A, used to represent x.) The condition (a) expresses that 
any linear combination of two vectors from L is again in L. Actually, (a) 
is equivalent to the following: 


(b) Any linear combination of vectors from L is again in L. 


(Strictly speaking, (a) and (b) are only equivalent when L 4 @. For L = ©, 
condition (a) holds, whereas (b) is violated by the fact that we allow the 
empty linear combination. Note, however, that we did require L # @ in 
the definition of a linear subspace.) 

The intersection of any family of linear subspaces of R® is again a linear 
subspace of R*. Therefore, for any subset M of R* there is a smallest linear 
subspace containing M, namely, the intersection of all linear subspaces 
containing M. This subspace is called the linear subspace spanned by M, 
or the linear hull of M, and is denoted by span M. 

One has the following description of the linear hull of a subset M: 


(c) For any subset M of R¥‘, the linear hull span M is the set of all linear 
combinations of vectors from M. 


(Note that our convention concerning the empty linear combination 
ensures that the correct statement span @ = {o} is included in (c).) 

An n-family (x,,..., X,) of vectors from R‘ is said to be linearly independent 
if a linear combination 4,x, + --: + A,x, can only have the value o when 
A, =-:: =A, = 0. (Note that the empty family, corresponding to n = Q, 
is linearly independent.) Linear independence is equivalent to saying that 
none of the vectors is a linear combination of the remaining ones. When a 


vector x is a linear combination of x,,...,X,, Say X =A,;X; +-°°-+4,%,, 
then the coefficients 4,,...,A4, are uniquely determined if and only if 
(x;,---,X,) 18 linearly independent. An n-family (x,,...,x,) which is not 


linearly independent is said to be linearly dependent. 

A linear basis of a linear subspace L of R® is a linearly independent n- 
family (x,,...,X,) of vectors from L such that L = span{x,,..., x,}..The 
dimension dim L of L 1s the largest non-negative integer n such that some 
n-family of vectors from L is linearly independent. A linearly independent 
n-family of vectors from L is a basis of L if and only ifn = dim L. 

Let M be any subset of R*%, and let n be the dimension of span M. Then 
there is actually a linearly independent n-family (x,,...,x,) of vectors 
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from M, i.e. there 1s a basis (x,,..., X,) of span M consisting of vectors from 
M. We therefore have: 


(d) For any subset M of R*, there exists a linearly independent family 
(xX1,.-.,X,) of vectors from M such that span M is the set of all linear 
combinations 


Of X15 +055 Xp- 
This statement is a sharpening of (c). It shows that to generate span M we 
need only take all linear combinations of the fixed vectors x,,..., x, from 


M. Furthermore, each vector in span M has a unique representation as a 
linear combination of x,,..., X,.- 

A mapping @ from some linear subspace L of R* into R° is called a linear 
mapping if it preserves linear combinations, 1.e. 


of » ix = » A; p(X;). 
i=1 i=1 

When @ is linear, then ¢g(L) is a linear subspace of R®. Linear mappings are 
continuous. 

A one-to-one linear mapping from a linear subspace L, of R* onto a 
linear subspace L, of R° is called a (linear) isomorphism. If there exists an 
isomorphism from L, onto L,, then L, and L, are said to be isomorphic. 
Two linear subspaces are isomorphic if and only if they have the same 
dimension. An isomorphism is also a homeomorphism, Le. it preserves the 
topological structure. 


We next move on to a discussion of the affine structure of R*. An affine 
subspace of R® is either the empty set @ or a translate of a linear subspace, 
i.e. a subset A = x + L where x € R’ and L is a linear subspace of R*. (Note 
that L is unique whereas x can be chosen arbitrarily in A.) By an affine 
space we mean an affine subspace of some R*. When A, and A, are affine 
subspaces of R* with A, < A3, we shall also call A, an affine subspace of 
A,. The elements x = (a,,..., 44) of some affine subspace A of R* will be 
called points when the affine structure, rather than the linear structure, is in 
the foreground. (However, it will not always be possible, nor desirable, 
to distinguish between points and vectors.) 

A subset A of R‘ is an affine subspace if and only if the following holds: 


(a’) A,X, + A,X, isin A for all x,,x,¢€AandalldA,,A,€RwithA, + A, = 1. 
For any two distinct points x, and x, in R%, the set 


{A,X +. ApxX2|A1,42E RA, + A, = 1 
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is called the line through x, and x,. The condition (a’) then states that the 
line through any two points of A is contained in A. 

An affine combination of points x;,..., x, from R‘ is a linear combination 
AyXy +++ + AqgX,, where A, +--- +A, = 1. We shall write 


n 
3 A;X; 
i=1 


to indicate that the linear combination 4,x, +---+4,x, 1S in fact an 
affine combination. (The empty linear combination is not an affine combina- 
tion. Therefore, in an affine combination 4,x, + --- + 4,x, we always have 
n > 1.) The condition (a’) states that any affine combination of two points 
from A is again in A. Actually, (a’) is equivalent to the following: 


(b’) Any affine combination of points from A is again in A. 


The intersection of any family of affine subspaces of R* is again an affine 
subspace of R*. (Here it is important to note that @ is an affine subspace.) 
Therefore, for any subset M of R‘ there is a smallest affine subspace containing 
M, namely, the intersection of all affine subspaces containing M. This 
affine subspace is called the affine subspace spanned by M, or the affine hull 
of M, and it is denoted by aff M. 

One has the following description of the affine hull of a subset M: 


(c’) For any subset M of R‘, the affine hull aff M is the set of all affine 
combinations of points from M. 


An n-family (x,, ..., x,,) of points from R?¢ is said to be affinely independent 
if a linear combination A,x, +--- + A,x, with A; + ---+ A, = 0 can only 
have the value o when A, = --- = A, = 0. (In particular, the empty family, 


corresponding to n = Q, is affinely independent.) Affine independence is 
equivalent to saying that none of the points is an affine combination of the 
remaining points. When a point x is an affine combination of x,,..., X,, 
say x =A,xX, +-::+4,X,, then the coefficients 1,,...,4, are uniquely 
determined if and only if (x,,..., x,) is affinely independent. An n-family 
(X1,..., X,) which is not affinely independent is said to be affinely dependent. 

Affine independence of an n-family (x,,...,X,) 18S equivalent to linear 
independence of one/all of the (n — 1)-families 


(x, — Xing eee, Xj — Xin X44 — Xj,.++5X_n — Xj); i= l,...,7. 


An affine basis of an affine space A is an affinely independent n-family 
(x,,...,X,) of points from A such that A = aff{x,,...,x,}. The dimension 
dim A of anon-empty affine space A is the dimension of the linear subspace L 
such that A = x + L. (Since L is unique, dim A is well defined. When A is a 
linear subspace, then the affine dimension and the linear dimension are the 
same by definition, and therefore we may use the same notation.) When 
A = ©, we put dim A = —1. The dimension of A is then n — 1 if and only 
ifn is the largest non-negative integer such that there is an affinely independent 
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n-family of points from A. An affinely independent n-family of points from 
A is an affine basis of A if and only ifn = dim A + 1. 

Let M be any subset of R%, and let the dimension of aff M be n — 1. 
Then there is actually an affinely independent n-family (x,,...,x,) of 
points from M, 1.e. there is an affine basis (x,,..., x,,) of aff M consisting of 
points from M. We therefore have: 


(d’) For any subset M of R*, there exists an affinely independent family 
(X;,-..,X,) Of points from M such that aff M is the set of all affine 


combinations 
AX: 
i=1 
Of Xy, 2665 Xpqe 
This statement is a sharpening of (c’). It shows that to generate aff M it 
suffices to take all affine combinations of the fixed points x,,..., x, from M. 


Furthermore, each point in aff M has a unique representation as an affine 
combination of x,,..., X,.- 

The 0-dimensional affine spaces are the 1-point sets. The 1-dimensional 
affine spaces are called lines. When x, and x, are two distinct points of R’%, 
then the 2-family (x,,x,) is affinely independent. Therefore, aff{x,, x,} 
is 1-dimensional, 1.e. a line, and it is in fact the line through x, and x, in 
the sense used earlier in this section. Conversely, the line through two points 
x, and x, in the earlier sense is in fact a 1-dimensional affine space, 1... a 
line. 

An (n — 1)-dimensional affine subspace of an n-dimensional affine 
space A, where n > 1, 1s called a hyperplane in A. If A 1s an affine subspace 
of R*, then the hyperplanes in A are the sets H ~ A where H is a hyperplane 
in R* such that H - A is a non-empty proper subset of A. 

A mapping 9 from an affine subspace A of R‘ into R¢ is called an affine 
mapping if it preserves affine combinations, i.e. 


of ya ix = 4 A; P(X;). 
i=1 i=1 


When @ 1s affine, then (A) is an affine subspace of R°. When A = x + L, 
where L is a linear subspace of R%, then a mapping g: A > R¢° is affine if and 
only if there exists a linear mapping ®: L > R° and a point ye R® such that 
g(x + z) = y + D(z) for all ze L. Affine mappings are continuous. 

An affine mapping g: A — R is called an affine function on A. For each 
hyperplane H in A there is a (non-constant) affine function g on A such that 
H = g +(0). Conversely, p~ ‘(0) is a hyperplane in A for each non-constant 
affine function @ on A. We have gy '(0) = w~ '(0) for two affine functions 
o and w on A if and only if @ = Aw for some non-zero real A. 

When ¢ is a non-constant affine function on an affine space A, we shall 
call the sets p~ '(]— 0, O[) and »~ 1(J0, + cof) the open halfspaces bounded 
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by the hyperplane H = gy '(0), and we shall call the sets po '(J—0,0]) 
and »~+([0, +00[) the closed halfspaces bounded by H = gy ‘(0). Open 
halfspaces are non-empty open sets, closed halfspaces are non-empty closed 
sets. If H = m ‘(0) is a hyperplane in A, then two points from A \ H are 
said to be on the same side of H if they both belong to g ‘((]- ©, OD or both 
belong to g '(J0, + co[); if each of the two open halfspaces contains one 
of the two points, we shall say that they are on opposite sides of H. 

A halfline is a halfspace in a line. 

Let A be an affine subspace of R*, and let K be a closed halfspace in R* 
such that A q K is a non-empty proper subset of A. Then A nm K is a closed 
halfspace in A. Conversely, each closed halfspace in A arises this way. 
The same applies to open halfspaces. 

For y€ R’ and a€R we let 


H(y, a) = {x € R*| <x, y> = a}. 


Note that H(o,«) = R¢? when « = 0, and H(o,a) = @ when a #0. The 
fact that the affine functions on R‘ are precisely the functions 


xreixy>-a yeRiaeR, 


implies that the hyperplanes in R® are precisely the sets H(y, «) for y # 0. 
If y 4 o, then y is called a normal of H(y, «). 
For ye R’ and a€R we let 


K(y, a) = {x € R*|<x, y> < a}. 


Note that K(o,«) = R* when a> 0, and K(o,a) = @ when « < 0. For 
y # 0, the set K(y, a) is one of the two closed halfspaces in R* bounded by 
H(y,«). The other closed halfspace bounded by H(y,a) is K(—y, —«). 
Note that 


bd K(y, «) = H(y, @), 


int K(y, «) = K(y, «) \ HQ, o), 
clint K(y, «)) = K(y, «), 


when y # o. 

A one-to-one affine mapping from an affine space A, onto an affine 
space A, is called an (affine) isomorphism. If there exists an isomorphism 
from A, onto A,, then A, and A, are said to be (affinely) isomorphic. Two 
affine spaces are isomorphic if and only if they have the same dimension. 
An isomorphism is also a homeomorphism, i.e. it preserves the topological 
structure. 

From what has been said above, it follows that any d-dimensional affine 
space A is affinely isomorphic to the particular d-dimensional affine space 
R*. In other words, A may be “identified” with R*%, not only in an affine 
sense but also in a topological sense. Note also that any given point of A 
can be “identified” with any given point of R’*. 
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Finally, we should like to point out that this section does not include all 


the necessary information about the linear and affine structure of R? needed 
in what follows. Some important additional information is contained in 
Exercises 1.1—1.5. 


EXERCISES 
1.1. Let (x,,...,x,) be an n-family of points from R*, where 
X; = (04;,---5 Lai) i=l,...,n. 
Let 
X= (1, 04;,..-, Xi), i= 1,...,n. 


1.2. 


1.3. 


1.4. 


1.5. 


1.6. 


1.7, 


Show that the n-family (x,,..., x,) is affinely independent if and only if the n-family 
(X,,..., X,) of vectors from R**! is linearly independent. 


For any subset M of R*, show that 


dim(aff M) = dim(span M) 
when o € aff M, and 


dim(aff M) = dim(span M) — 1 
when o ¢ aff M. 


Let A be an affine subspace of R%, and let H be a hyperplane in R“*. Show that 
dim(A 4 H) =dimA — 1 
when AN H 4 @andA ¢£ H. 


Let A, = x, + L, and A, = x, + L, be non-empty affine subspaces of R*. Then 
A, and A, are said to be parallel if L,; < L, or L, < L,, complementary if L, and 
L, are complementary (and orthogonal if L, and L, are orthogonal). 

Show that if A, and A, are parallel and A, 01 A, # @, then A; CA, or 
A, < A,. 

Show that if A, and A, are complementary, then A, 4 A, is a 1-point set. 


Let A, = x, + L, and A, = x, + L,becomplementary affine subspaces of R%, and 
let Xg be the unique common point of A, and A,, cf. Exercise 1.4. Then A, = 
Xo + L, and A, = Xo + L,. Let II: R4 > L, denote the projection in the direction 
of L,. For any x € R%, let x(x) := x9 + T(x — x). Show that 2(x) is the unique 
common point of A, and (x — x.) + A,. (The mapping z is called the projection 
onto A, in the direction of A,. When A, and A, are also orthogonal, then z is called 
the orthogonal projection onto A,.) 


An n-family (x,,..., X,) of points from R® is said to be in general position if every 
subfamily (x;,,...,%;,) with p<d+1 is affinely independent. Verify that 
(x,,..-., X,) 1S in general position if and only if for each k withO < k < d — 1 and 


for each k-dimensional affine subspace A of R%, the number of i’s such that x, € A 
is at most k + 1. 


Let x,,..., x, be distinct points in R*. Show that there is w 4 o such that for each 
a € R, the hyperplane H(w, «) contains at most one of the points x,,..., X,. 
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§2. Convex Sets 


In this section we shall introduce the notion of a convex set and we shall 
prove some basic facts about such sets. In Section 1 we demonstrated a 
strong analogy between linear concepts and affine concepts. This analogy 
carries over to convex concepts, though not in a complete fashion. 


A subset C of R¢ is called a convex set if A,x, + A,X belongs to C for all 
X1,X,¢€C and alld,,A,eR with A, + A, = land 4, 4, = 0. 
When x, and x, are distinct points from R‘, then the set 


[x1,X2] = {Ayx, + AQxX2|A1,42 = 9, Ay tA, = 13 


is called the closed segment between x, and x,. Half-open segments |x,, x2], 
[x,,x.[ and open segments |x,, x,[ are defined analogously. With this 
notation, a set C is convex if and only if the closed segment between any two 
points of C is contained in C. 

The affine subspaces of R‘4, including R* and @, are convex. Any (closed 
or open) halfspace is convex. 

The image of a convex set under an affine mapping is gain convex. In 
particular, translates of convex sets are again convex. 

By a convex combination of points x,,..., X, from R* we mean a linear 
combination A,x, +--:+4,x,, where A, +--- +A, = land ,,...,4, = 
0. Every convex combination is also an affine combination. We shall write 


to indicate that the linear combination J,x, + --- + A, x, 1S in fact a convex 
combination. The definition of a convex set expresses that any convex 
combination of two points from the set is again in the set. We actually have: 


Theorem 2.1. A subset C of R* is convex if and only if any convex combination 
of points from C is again in C. 


Proor. If any convex combination of points from C is again in C, then, in 
particular, any convex combination of two points from C 1s in C. Therefore, 
C is convex. 

Conversely, assume that C is convex. We shall prove by induction on n 
that any point from R¢ which is a convex combination of n points from C 
is again in C. For n = 1 this is trivial, and for n = 2 it follows by definition. 
So, let n be at least 3, assume that any convex combination of fewer than n 
points from C is in C, and let 


12 1. Convex Sets 


be a convex combination of n points x,,..., x, from C. If A; = 0 for some i, 
then x is in fact a convex combination of fewer than n points from C, and so x 
belongs to C by hypothesis. If A; 4 0 for all i, then A; < 1 for all i, whence, 
in particular, 1 — A, > 0. Therefore, we may write 


os 

| 
M4 
=o 
Ra’ 


= Aix + SAX; 


i=2 


=A,x,+(U - ayy Xj 


Here 


is in fact a convex combination since 4, + --- +4, =1—4,, and so y 
is in C by hypothesis. By the convexity of C then 4,x, + (1 — 4,)y 1s also 
in C, i.e. x is in C. LJ 


It is clear that the intersection of any family of convex sets in R® is again 
convex. Therefore, for any subset M of R¢ there is a smallest convex set 
containing M, namely, the intersection of all convex sets in R* containing M. 
This convex set is called the convex set spanned by M, or the convex hull of 
M, and it is denoted by conv M. 

It is clear that conv(x + M) = x + convM for any point x and any set 
M. More generally, it follows from Theorem 2.2 below that conv(@(M)) = 
(conv M) when @ is an affine mapping. 

We have the following description of the convex hull of a set: 


Theorem 2.2. For any subset M of R*, the convex hull conv M is the set of 
all convex combinations of points from M. 


Proor. Let C denote the set of all convex combinations of points from M. 
Since M c conv M, each x € C is also a convex combination of points from 
the convex set conv M; the “only if” part of Theorem 2.1 then shows that 
C < conv M. To prove the opposite inclusion, it suffices to show that C 
is a convex set containing M. Since each x € M has the trivial representation 
x = 1x as a convex combination of points from M, it follows that McC. 
To see that A,x, + A,x, is in C for each x,,x,¢€C and each 4,,A, > 0 
with A, + A, = 1, note that by definition x, and x, are convex combinations 
of points from M, say 


= YS Midis X27 = yi Li Yi- 


i=1 t=n+1 
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But then 
AX + A2X2 = Y Amy + XY Amv 
and 
Ain; = 9, Aru; = 9, YA + Da =1. 
This shows that A,x, + A, x, is aconvex combination of the points y,,..., Vm 
from M, whence A,x, + A,x, is in C, as desired. LJ 


Up to now we have had complete analogy with Section 1. The concept 
of a basis of a linear or affine subspace, however, has no analogue for convex 
sets in general. Still, we have the following substitute: 


Theorem 2.3. For any subset M of R*, the convex hull conv M is the set of 
all convex combinations 


» AiX; 
i=1 
such that (x,,..., X,) is an affinely independent family of points from M. 


In other words, in order to generate conv M we need not take all convex 
combinations of points from M as described by Theorem 2.2; it suffices to 
take those formed by the affinely independent families of points from M. 
On the other hand, no fixed family of points from M will suffice, as in the case 
of span M or aff M, cf. (d) and (d’) of Section 1. 


Proor. We shall prove that if a point x is a convex combination of n points 
X1,...,X, Such that (x,,...,x,) is affinely dependent, then x is already a 
convex combination of n — 1 of the points x,,..., x,. Repeating this argu- 
ment, if necessary, it follows that there is an affinely independent subfamily 
(Xj,,---,%;,) Of (4,...,x,) such that x is a convex combination of 
Xj,,---+,X;,- The statement then follows from Theorem 2.2. 

So, suppose that we have 


n 


x= y< A; Xi» (1) 
i=1 
where (x,,..., X,) is affinely dependent. If some J; is 0, then x is already a 
convex combination of n — 1 of the points x,,...,x,. Hence, we may 
assume that all A; are >0. The affine dependence means that there are reals 
L44,---5 Hy, not all 0, such that 


y HX; = O, y LH, = 0. (2) 
i=1 
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Combining (1) and (2) we see that for any real t we have 


x= d (A; — tyi)x; (3) 


and 
» A; — tu) = 1. 
i=1 


We now simply seek a value of t (in fact, a positive value) such that A; — tu; = 
0 for alli, and A; — tu; = 0 for at least one i; then (3) will be a representation 
of x as a convex combination of n — 1 of the points x,,...,x,. We have 
A, — tu; > 0 for any t > 0 when yp; < 0. When pi; > 0, we have 4; — tu; = 0 
provided that t < A,/u;, with 2; — tu; = 0 if and only if t = A,/u;. Noting 
that we must have py; > 0 for at least one i, we see that 


t:=min{A;/u;|u; > 9} 


fulfils the requirements. L] 
The following two corollaries are both known as Carathéodory’s Theorem: 


Corollary 2.4. For any subset M of R¢ with dim(aff M) = n, the convex hull 
conv M is the set of all convex combinations of at most n + 1 points from M. 


Proor. For any affinely independent m-family (x,,..., X,,) of points from M, 
we have m <n+ 1 by the assumption. Therefore, the set of all convex 
combinations of n + 1 or fewer points from M contains conv M by Theorem 
2.3. On the other hand, it is contained in conv M by Theorem 2.2. LJ 


Corollary 2.5. For any subset M of R¢ with dim(aff M) = n, the convex hull 
conv M is the set of all convex combinations of precisely n + 1 points from M. 


PROOF. In a convex combination one may always add terms of the form Ox. 
Therefore, the statement follows from Corollary 2.4. a 


By a convex polytope, or simply a polytope, we mean a set which is the 
convex hull of a non-empty finite set {x,,..., x,}. If P is a polytope, then any 
translate x + P of P is also a polytope; this follows from the fact that x + 
conv M = conv(x + M). More generally, the image of a polytope under an 
affine mapping is again a polytope; this follows from the fact that 
~(conv M) = conv g(M) when ¢g is an affine mapping. 

A polytope S with the property that there exists an affinely independent 
family (x,,...,X,) such that S = conv{x,,..., X,} 18 called a simplex (and 
the points x,,..., xX, are called the vertices of S; cf. the remarks following 
Theorem 7.1). 

One might say that simplices have a “convex basis,” cf. the remark 
preceding Theorem 2.3. In fact, if x,,..., x, are the vertices of a simplex S, 
then by the affine independence each point in aff{x,,...,x,} has a unique 
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representation as an affine combination of x,,..., x,, whence, in particular, 
each point in conv{x,,...,x,} has a unique representation as a convex 
combination of x,,..., x,, cf. Theorem 2.3. 


Convex sets having a “convex basis” in the sense described above must, 
of course, be polytopes. The following theorem shows that simplices are the 
only polytopes having a “convex basis”: 


Theorem 2.6. Let M = {x,,..., x,} be a finite set of n points from R* such 


that the n-family (x,,...,X,) is affinely dependent. Then there are subsets 
M, and M, of M with M, 0M, = ©@ and M, U M, = M such that 


conv M, n conv M, # @. 


PROOF. By the affine dependence there are reals 1,,..., ,, not all 0, such that 


y A; X; = O, » A; = (). (4) 


Denoting the set {1,...,n} by J, we let 
I,:= {iel|A,; > 0}, I,:= {iel|A; < 0}, 


and we let 
M, = {x,|ie Ty}, Mz = ix; {ie 14}. 
Now, take 
J. 
X= y —X;, (5) 
tel, A 
where 


A= VA. 


lely 
(It is clear that 1, # @, whence A > 0.) The right-hand side of (5) is in fact a 


convex combination, whence x is in conv M, by Theorem 2.2. However, 
using (4) we see that we also have 


U 
x=) a Xin 
tel2 


and again we actually have a convex combination. Therefore, x is also in 
conv M,. Consequently, conv M, and conv M, have the point x in common. 


LO 


The following corollary of Theorem 2.6 is known as Radon’s Theorem: 


Corollary 2.7. Let M = {x,,...,x,} be a finite set of n points from R¢ such 
thatn > d + 2. Then there are subsets M, and M, of M withM, 0M, = @ 
and M, U M, = M such that 


conv M, n conv M, # @. 
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ProoF. The maximum number of members in an affinely independent 
family of points from R* is d + 1. Therefore, (x,,...,x,) must be affinely 
dependent, whence Theorem 2.6 applies. a 


We conclude this section with an important application of Carathéodory’s 
Theorem. 


Theorem 2.8. For any compact subset M of R‘%, the convex hull conv M is 
again compact. 


PROOF. Let (y,),-. be any sequence of points from conv M. We shall prove 
that the sequence admits a subsequence which converges to a point in conv M. 
Let the dimension of aff M be denoted by n. Then Corollary 2.5 shows that 
each y, in the sequence has a representation 


n+1 


y= yi AyiXvyis 


i=1 


where x,;€ M. We now consider the n + 1 sequences 


(Xv vers v9 (Xvint tyven (6) 
of points from M, and the n + 1 sequences 
Av ven> se Cayint 1) ven (7) 


of real numbers from [0, 1]. By the compactness of M there is a subsequence of 
(X,1)yen Which converges to a point in M. Replace all 2(n + 1) sequences 
by the corresponding subsequences. Change notation such that (6) and (7) 
now denote the subsequences; then (x,,),-qy converges in M. Next, use the 
compactness of M again to see that there is a subsequence of the (sub)sequence 
(X\2)yen Which converges to a point in M. Change notation, etc. Then after 
2(n + 1) steps, where we use the compactness of M in step 1,...,n + 1, 
and the compactness of [0,1] in step n + 2,...,2n + 2, we end up with 
subsequences 


(Xy,meN> sy (Xy nt iymen 
of the original sequences (6) which converge in M, say 


lim x 


m— co 


= XoQi>s i=l,...,nt+1, 


Vmil 
and subsequences 


(Ay men> ne) (Ay n+ 1))meN 
of the original sequences (7) which converge in [0, 1], say 


lim A, = Aoi i=1l,...,n4+1. 


Since 
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we also have 


Then the linear combination 


is in fact a convex combination. Therefore, yp 1s in conv M by Theorem 2.2. 
It is also clear that 
lim Vom => Yo: 


m—> © 
In conclusion, (yy, )mery 1s a subsequence of (y,),<¢. Which converges to a point 
in conv M. = 


Some readers may prefer the following version of the proof above. With 
n = dim(aff M) as above, let 


S:= {(Aq,- Ane DER AL... Anat = 0, A, + vs + Ant] = 1}, 
and define a mapping gy: M"*! x S > R¢ by 


n+1 


P((X4,---5Xntads (Aq, -- +5 Ang a) = » A;X;. 


i=1 


By Corollary 2.5, the set p(M"*! x S) is precisely conv M. Now, M"*! x S 
is compact by the compactness of M and S, and @ is continuous. Since the 
continuous image of a compact set is again compact, it follows that conv M 
is compact. 

Since any finite set is compact, Theorem 2.8 immediately implies: 


Corollary 2.9. Any convex polytope P in R‘ is a compact set. 


One should observe, however, that a direct proof of Corollary 2.9 does not 
require Carathéodory’s Theorem. In fact, if M is the finite set {x,,..., x,,}, 
then each y, (in the notation of the proof above) has a representation 


m 
y= » AyiX;. 
i=1 
Then we have a similar situation as in the proof above (with m corresponding 
ton + 1), except that now the sequences corresponding to the sequences (6) 


are constant, x,; = x; for all v. Therefore, we need only show here that the 
sequences (7) admit converging subsequences (which is proved as above). 


EXERCISES 
2.1. Show that when C, and C, are convex sets in R“, then the set 
C, + Cy := {x, + x,|x,EC,,x,€C,} 


is also convex. 
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2.2. 


2.3. 


2.4. 


2.5. 


2.6. 


2.7. 


2.8. 


2.9. 


2.10. 


2.11. 


1. Convex Sets 


Show that when C is a convex set in R“%, and J is a real, then the set 
AC := {Ax|x EC} 
is also convex. 


Show that when C is a convex set in R*, and g: R* > R° is an affine mapping, then 
@(C) is also convex. 


Show that conv(M, + M,) = conv M, + conv M, for any subsets M, and M, 
of R*. 


Show that when M is any subset of R*, and g: R* > R¢ is an affine mapping, then 
g(conv M) = conv g(M). Deduce in particular that the affine image of a polytope 
is again a polytope. 


Show that when M is an open subset of R*%, then conv M is also open. Use this fact 
to show that the interior of a convex set is again convex. (Cf. Theorem 3.4(b).) 


Show by an example in R* that the convex hull of a closed set need not be closed. 
(Cf. Theorem 2.8.) 


An n-family (x,,..., X,) of points from R¢ is said to be convexly independent if no 
x; 1n the family is a convex combination of the remaining x,’s. Forn > d + 2, show 
that if every (d + 2)-subfamily of (x,,..., x,) iS convexly independent, then the 
entire n-family is convexly independent. 


Let (C,);., be a family of convex sets in R¢ with d + 1 < card J. Consider the 
following two statements: 


(a) Any d + 1 of the sets C; have a non-empty intersection. 
(b) All the sets C; have a non-empty intersection. 


Prove Helly’s Theorem: If card I < o, then (a) = (b). (Hint: Use induction 
on n:= card J. Apply Corollary 2.7.) 

Show by an example that we need not have (a) = (b) when card I = oo. 

Prove that if each C; is closed, and at least one is compact, then we have 
(a) = (b) without restriction on card I. 


Let a point x in R* be a convex combination of points x,,..., x,, and let each x; 
be a convex combination of points yj;;, ..., Yi,,- Show that x is a convex combina- 
tion of the points y,,,,i = 1,...,a,v, = 1,..., Nn). 


Let (C,);.; be a family of distinct convex sets in R*. Show that 


conv (J C, 


ie] 


is the set of all convex combinations 


where x, € C;,. 
Deduce in particular that when C, and C, are convex, then conv(C, U C,) 1s 
the union of all segments [x,, x,] with x, EC, and x,€C;j. 
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§3. The Relative Interior of a Convex Set 


It is clear that the interior of a convex set may be empty. A triangle in R°, 
for example, has no interior points. However, it does have interior points in 
the 2-dimensional affine space that it spans. This observation illustrates the 
definition below of the relative interior of a convex set, and the main result of 
this section, Theorem 3.1. We shall also discuss the behaviour of a convex 
set under the operations of forming (relative) interior, closure, and boundary. 


By the relative interior of a convex set C in R* we mean the interior of C 
in the affine hull aff C of C. The relative interior of C is denoted by riC. 
Points in r1C are called relative interior points of C. The set cl C \ ri C 1s 
called the relative boundary of C, and is denoted by rbC. Points in rb C 
are called relative boundary points of C. (Since aff C is a closed subset of 
R‘, the “relative closure” of C is simply the closure of C. Hence, the relative 
boundary of C is actually the boundary of C in aff C.) 

It should be noted that the ri-operation is not just a slight modification 
of the int-operation. Most striking, perhaps, is the fact that the ri-operation 
does not preserve inclusions. For example, let C, be a side of a triangle 
C,. Then C, < C,, whereas ri C, Z ri C,; in fact, riC, and ri C, are non- 
empty disjoint sets. 

By the dimension of a convex set C we mean the dimension dim(aff C) 
of the affine hull of C; it 1s denoted by dim C. The empty set has dimension 
—1. The 0-dimensional convex sets are the 1-point sets {x}. The 1-dimen- 
sional convex sets are the (closed, half-open or open) segments, the (closed 
or open) halflines, and the lines. 

For a O-dimensional convex set C = {x}, we clearly have riC = C, 
clC = C,and rbC = @. 

We have ri C = int C for a non-empty convex set C in R“ if and only if 
int C # @. In fact, if intC #4 @ then aff C = R“% whence ri C = int C by 
the definition of ri C. The converse is a consequence of the following: 


Theorem 3.1. Let C be any non-empty convex set in R*. Then riC #4 @&. 
We first prove Theorem 3.1 for simplices: 


Lemma 3.2. Let S be a simplex in R*. Then ri S # @. 


Proor. When dim S = k, there is a (k + 1)-family (x,,..., X,+ 1), affinely 
independent, such that 


S = conv{x,, eee Xpar}: 
Then (x,,..., X,4,) 1S an affine basis of aff S; hence, aff S is the set of points 
of the form 
k+1 


x= Y71;x;, 
i=1 
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and for each x € aff S, the coefficients A,,...,A,4, are unique. Therefore, 
we may define a mapping 


o.aff S—> Rt? 
by letting 


k+1 
o( » i) == (A,,..., Ag+). 
i=1 


This is actually an affine mapping; in particular, it is continuous. Let 
K, = {(Ay, 0s Aga) € RelA; > OF i=1,...,k4+ 1. 


Then K,,..., K,4, are open halfspaces in «+! and therefore, by continuity, 
the sets o~'(K,),..-,@ '(Kz+1) are open (in fact, open halfspaces) in 
aff S. The set 


k+1 


(9° Ko, (1) 


is therefore also open in aff S. Now, note that 


k+1 k+1 
(| gp *(K)) = > Aix; 
i=1 i=1 


This shows in particular that the set (1) is non-empty. And since affine 
combinations A,x, + -°:° + Aya iXe+1 With all A; > 0 are convex combina- 
tions, we see that the set (1) is a subset of S. In other words, the set S contains a 
non-empty set which is open in aff S, whence ri S # ©. (The proof shows 
that the set (1) is a subset of ri S. Actually, the two sets are the same.) LJ 


dasseesdans > Of 


With Lemma 3.2 at hand we can now pass to: 
PrRooF (Theorem 3.1). Let 
k := dim C (= dim(aff C)). 


Then there is an affinely independent (k + 1)-family (X1,--+5 Xe¢1) Of 
points from C (but no such (k + 2)-family). Let 


S — conv{x,, cg Xe+ i} 


Then S is a simplex contained in C. By Lemma 3.2, S has a non-empty 
interior relative to aff S. Since 


aff S < aff C 
and 
dim(aff S) = k = dim(aff C), 
we actually have 


aff S = aff C. 
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Therefore, S has a non-empty interior relative to aff C. But since S is a subset 
of C, it follows that C has a non-empty interior relative to aff C, as desired. 


LC] 


The following theorem shows that any point in the closure of a convex set 
C can be “seen” from any relative interior point of C “via” relative interior 
points: 


Theorem 3.3. Let C be a convex set in R*. Then for any x) €riC and any 
x, €cl C with xo # x, we have [x9,X,[ < riC. 


PROOF. It is easy to prove the statement in the particular case where we have 
X)>eEintC and x,eéEC. For Ae JO, 1[, let x,:=(1 — A)xo + Ax,. From 
Xo € int C it follows that there is a ball B centred at x) with B c C. From 
x, €C and the convexity of C it next follows that 


B, = (1 _ A)B + AX 


is contained in C. But B, is a ball centred at x,, whence x, € int C, as desired. 
The proof below covering the general case is an elaborate version of this 
idea. Of course, the main difficulty is that x, need not be in C. 

So, consider x9 € ri C and x, ecl C with xy # x,. For any Ae JO, 1[, let 


X) = (i — A)X 0 + AX}. 


We shall prove that x, eri C. Since xq is a relative interior point of C, 
there is a (relatively) open subset U of aff C such that x,» ¢ U < C. Let 


V:=A7"(x, — (1 — AU). 
Since 
A~+ ~ 2-41 — A) = 1, 


it follows that V is a subset of aff C, and it is, in fact, (relatively) open. And 
since 
x, =A*'(x,-(1 - A)xo), 


we see that x, € V. Therefore, by the assumption that x, €cl C, there is a 
point y, EVC. Let 


W:= (1 — AU + Ay. 


Then W 1s a (relatively) open subset of aff C, and since we have both U < C 
and y, €C, it follows that W c C by the convexity of C. We complete the 
proof by showing that x, € W. From the definition of V it follows that there 
iS a point yo € U such that 


y =a \(x,-(- A)Yo): 
Then 
x, = (1 — Ayo + AY, 
eal —AU+ Ay, =W, 
as desired. CJ 
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Theorem 3.3 is a useful tool. Among other things, it is crucial for the proofs 
of all the statements, except (a), in the following theorem. The theorem 
brings out the nice behaviour of convex sets. 


Theorem 3.4. For any convex set C in R¢ one has: 


(a) cl C is convex. 

(b) ri C is convex. 

(c) cl C = ch(cl C) = cl(ri C). 

(d) nC = ri(cl C) = ri(i C). 

(e) rb C = rb(cl C) = rb(m C). 

(f) aff C = aff(cl C) = aff(nC). 

(g) dim C = dim(cl C) = dim(ri C). 


PROOF. For C = @, there is nothing to prove. So, we may assume that C 
is non-empty, whenever necessary. 
(a) Let Xo, x, E€cl C, and let Ae j0, 1[. We shall prove that the point 


Xy = (i —_ A)Xo + AX 1 
is also in cl C. Now, there are sequences 

(Xowve No (Xiyve N 
of points from C such that 


lim Xo, = Xo, lim x, = X,. 


By the convexity of C, the points 
(1 — A)xo, + AX4,, veEN, 
are all in C. Furthermore, 


lim (1 —_ A)Xoy + AX 1.) = (1 “— A)X + AX, = Xj. 


vo oe 


This shows that x, ecl C. 
(b) We shall prove that for any x9, x, Eri C and any Ae 0, 1[, the point 


xX) = (1 —_ A)Xo + AX} 


is also in ri C. This follows immediately from Theorem 3.3. 

(c) The statement cl C = cl(cl C) is trivial. It is also trivial that cl(r1 C) < 
cl C. To prove the opposite inclusion, let x, € cl C. Take any point x9 € ri C, 
cf. Theorem 3.1. If x) = x,, then we have 


x,EenC cock C), 
as desired. If xp # x,, then we have 


[xXo, x4[ Cc rl C, 


$3. The Relative Interior of a Convex Set 23 


cf. Theorem 3.3. Since each neighbourhood of x, contains points from 
[xX 9,X,[, 1t follows that x, is in cl(ri C). 
(d) To prove that ri C = ri(cl C), we first note that 


aff C = aff(cl C), (2) 


since aff C is closed. Then it is clear that ri C < ri(cl C). To prove the opposite 
inclusion, let x be in ri(cl C). Take any point x) eriC, cf. Theorem 3.1. If 
Xo = X, then we have xeriC, as desired. If x9 # x, then aff{xo,x} is a 
line, and we have 


aff {xo, x} < aff(cl C) = aff C. 


Since x eri(cl C), there is a point x, €aff{x,,x} such that x,eclC and 
x € |x), x,[. Application of Theorem 3.3 then yields x € ri C. Hence, riC = 
ri(cl C). 

To prove that ri C = ri(ri C), we first verify that 


aff C = aff(ri C). (3) 
Applying (2) to ri C instead of C and using (c), we obtain 
aff(ri C) = aff(cl(ri C)) 

= aff(cl C) 

= aff C. 
Now, using the notation int,-;- C for ri C, we have 

= 1Ntare (tI C), 
where we have used (3). But 
INtare c(t] C) = iNtyre cCUUNtare c C) 
=> INtserc C 
=r1C, 


where we have used the standard fact that int(int M) = int M for any set M. 
This completes the proof of (d). 
(e) By definition we have 


mbC=clC\ric, 
rb(cl C) = cl(cl C) \ ri(cl C), 
rb(ri C) = cl(ri C) \ ri(ri C). 


The statement then follows using (c) and (d). 
(f) This has already been proved, cf. (2) and (3) above. 
(g) This follows from (f). LJ 
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The next theorem also depends on Theorem 3.3. It shows that the relative 
interior points of a convex set C may be characterized in purely algebraic 
terms: 


Theorem 3.5. For any convex set C in R* and any point x €C the following 
three conditions are equivalent: 


(a) xeric. 
(b) For any line A in aff C with x EA there are points yo, yy; € AAC such 


that x € ]yo, yiL. 
(c) For any point ye C with y ¥ x there is a point zEC such that x € |y, z[, 


i.e. any segment [y, x] in C can be extended beyond x in C. 


Proof. The implications (a) = (b) and (b) = (c) are obvious. Therefore, 
we need only prove (c) = (a). By Theorem 3.1 there is a point yeriC. If 
y = x, there is nothing more to prove. If y # x, then by (c) there is a point 
z€C such that x € Jy, z[. But then x is in ri C by Theorem 3.3. LI 


We conclude this section with an application of Theorem 3.4(a). Let M 
be any set in R*. Then there is a smallest closed convex set containing M, 
namely, the intersection of all closed convex sets containing M. We call this 
set the closed convex hull of M, and denote it by clconv M. As might be 
expected, we have: 

Theorem 3.6. Let M be any subset of R*. Then 
clconv M = cl(conv M), 
i.e. the closed convex hull of M is the closure of the convex hull of M. 


PRrooF. Using Theorem 3.4(a) we see that cl(conv M) is a closed convex set 
containing M. Since clconv M is the smallest such set, it follows that 


clconv M « cl(conv M). 
On the other hand, clconv M is a convex set containing M, whence 
cleconv M > conv M. 
Since clconv M 1s also closed, this implies 


cleonv M > cl(conv M), 
completing the proof. LJ 


EXERCISES 


3.1. Let P = conv{x,,..., X,} be a polytope in R*. Show that a point x is in ri P if and 
only if x is a convex combination of x,,..., x, with strictly positive coefficients, 1.e. 
there are 4,,..., 4, Such that 


n 
x= \° Ax; 
i=1 


and J, > Ofori =1,..., n. 
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3.2. Let C, and C, be convex sets in R*. Show that ri(C, + C,) = ri C, + riC,. 


3.3. Let C be a convex set in R*, and let g: R¢ > R® be an affine mapping. Show that 
ri o(C) = (ri C). 


3.4. Let (C;);<; be a family of convex sets in R? such that 


(\ ri C; # S. (4) 
iel 
Show that 
cl (\ C; = (\ el C;. (5) 
iel iel 


Show that if (4) does not hold, then (5) need not hold. 


3.5. Let (C;);=1....., be a finite family of convex sets in R4 such that 
q iC,  &. (6) 
Show that 
ri VC, = an C;. (7) 
i=1 i=1 


Show that if (6) does not hold, then (7) need not hold. 


§4. Supporting Hyperplanes and Halfspaces 


It is intuitively clear that when x is a relative boundary point of a convex set C, 
then there is a hyperplane H passing through x such that all points of C not in 
H are on the same side of H. One of the main results of this section shows that 
it is in fact so. 


Let C be a non-empty closed convex set in R*. By a supporting halfspace of C 
we mean a Closed halfspace K in R“ such that C c K and HAC ¥¢ @, where 
H denotes the bounding hyperplane of K. By a supporting hyperplane of C we 
mean a hyperplane H in R* which bounds a supporting halfspace. 

In the definition of a supporting hyperplane H of C we allow Cc H 
(in which case both closed halfspaces bounded by H are supporting halfspaces). 
If C is not contained in H we shall call H a proper supporting hyperplane. 

Analytically, a hyperplane H(y, «) is a supporting hyperplane of a non- 
empty closed convex set C if and only if 


ao = max <x, y> (1) 
xeC 
or 
a = min <x, y». (2) 


xeCc 
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If (2) holds for H(y, «), then (1) holds for H(—y, —«). Since H(—y, —a) = 
H(y, «), it follows that any supporting hyperplane H of C has the form H(y, «) 
such that (1) holds, whence C < K(y, a). Note, by the way, that if H(y, «) is a 
supporting hyperplane such that C c K(y, «), then H(y, a) is proper if and 
only if 

inf <x, y> < max <x, y>. 


xe€ xe 


We first prove: 


Theorem 4.1. Let C be anon-empty convex set in R‘, and let H be a hyperplane 
in R*. Then the following two conditions are equivalent: 


(a) HnriC= ©. 
(b) Cis contained in one of the two closed halfspaces bounded by H, but not in H. 


Proor. Assume that (a) holds. Let x, eri C, cf. Theorem 3.1. Then x ¢ H by 
(a). In particular, C is not contained in H. Suppose that there is a point x; € C 
such that x, and x, are on opposite sides of H. Then there are y and a such 
that H = H(y, «) and 


(X05 y> <a< (X45 y>. 
Taking 
_ a — {X05 y> 
(X1, YP — «Xo, YW? 
and 
x,:= (1 — A)xg + AX, 
we have A e€ ]0, I[, and so x, € |xg, x,[. Furthermore, an easy computation 
shows that <x,, y> = a, whence x, € H. On the other hand, since x) € 11 C and 
x, € ]Xo, X;[, it follows from Theorem 3.3 that we also have x, € ri C, whence 
x,€H riC, a contradiction. In conclusion, C is contained in that closed 
halfspace bounded by H which contains the point xo. 
Conversely, assume that (b) holds. Suppose that there is a point 
xé€H ari C. By (b) there is a point ye C\H. Then by Theorem 3.5, (a) = (c) 
there is a point ze C such that xe Jy, zl, whence 


x=(1-—A)y+ Az 
for a suitable A € ]0, 1[. Now, there are u and « such that H = H(u, a) and 
C c K(u, a). Then <y, u> < a and <z, u> < a, whence 
<x, u» = <6 _ A)y + Az, u» 
= @! ~ A)<y, u» + AXZ, u» 
<(1—A)ja+ ha=a. 


At the same time we have <x, u» = a since x € H, a contradiction. Therefore, 
H «ri C is empty. L} 
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We immediately get: 


Corollary 4.2. A supporting hyperplane H of anon-empty closed convex set C in 
IR‘ is a proper supporting hyperplane of C if and only if H A riC = ©. 


The following result is fundamental: 


Theorem 4.3. Let C be aclosed convex set in R4, and let x be a point inrb C. Then 
there is a proper supporting hyperplane H of C such that x € H. 


We shall build the proof of Theorem 4.3 upon the following: 


Lemma 4.4. Let C be a non-empty open convex set in R‘, and let x be a point 
of R* not in C. Then there is a hyperplane H in R* such that x €H and 
HnC= @. 


PROOF. We shall use induction on d. The statement is trivially true ford = 0, 1. 
We also need a proof for d = 2, however. So, let C be a non-empty open convex 
set in R*, and let x € R*\C. We shall prove that there exists a line L in R? such 
thatxe Land LAC = ©. Let S bea circle with its centre at x, and for each 
point ue€ C let u’ be the unique point of S where the halfline 


{(1 — A)x + Au|A > O} 
from x through u meets S. Then the set 
C':= {u'|ueC} 


is an open arc in S. Since x ¢ C and C is convex, two opposite points of S can- 
not both be in C’. Therefore, the angle between the two halflines from x 
through the endpoints of C’ is at most nm. Any of the two lines determined by 
one of these halflines can then be used as L. (If the angle is n, then, of course, L 
is unique.) 

Next, let d > 2, and assume the statement is valid for all dimensions less 
than d. Let C be a non-empty open convex set in R*, and let x € R“\C. (See 
Figure | which illustrates the “difficult” situation where x € cl C.) Take any 
2-dimensional affine subspace A of R* such that x € A and AC ¥ @. Then 
A ~ Cisanon-empty open convex set in A with x ¢ A A C. Identifying A with 
R* and using the result on R? proved above, we see that there exists a line L 
in A such that xe L and 


LA(ANC)= LAC = @. 


Let B be any hyperplane in R* orthogonal to L, and let z: R? — B denote the 

orthogonal projection. Then z(C) is a non-empty open convex set in B. More- 

over, since 2 *(m(x)) = L, we see that m(x) ¢ x(C). Then, by hypothesis, there 

is a hyperplane H’ in B such that 2(x) € H’ and H’ a n(C) = @. But then 
H := aff(H’ U L) = 27 '(f’) 


is a hyperplane in R¢ withxe H and HOC = @. C 
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Figure | 


We can now prove Theorem 4.3: 


Proor (Theorem 4.3). When dim C = ~—1, 0, there is nothing to prove. So, let 
dim C > 1, and let C and x be as described. We shall apply Lemma 4.4 to the 
convex set ri C and the point x in the affine space aff C. (Here we need to 
identify aff C with R° where e := dim(aff C).) To see that Lemma 4.4 applies, 
note that ri C is non-empty by Theorem 3.1, convex by Theorem 3.4(b), and 
open in aff C; furthermore, x is in aff C. Application of Lemma 4.4 then yields 
the existence of a hyperplane H’ in aff C such that xe H’ and H’ornC = @. 
Clearly there is a hyperplane H in R* such that H 1 aff C = H’. (If already 
aff C = R*, then H = H’.) Then we also have xe H and HnriC = @. 
Theorem 4.1, (a) > (b) finally shows that H is in fact a proper supporting 


hyperplane. O 
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The following theorem is also fundamental: 


Theorem 4.5. Let C be anon-empty closed convex set in R*. Then C is the inter- 
section of its supporting halfspaces. 


Proor. When dim C = 0, the theorem is clearly true. When C = R’, there 
are no supporting halfspaces; hence, the theorem is also true in this case. 
So, let dim C > 1, and let x be a point of R* outside C; we shall prove that 
there is a supporting halfspace K of C such that x ¢ K. If x ¢ aff C, there is a 
hyperplane H in R¢ with aff C < H and x ¢ H. The closed halfspace bounded 
by H which does not contain x then has the desired property. If x € aff C, let z 
be a relative interior point of C, cf. Theorem 3.1. Then [z, x] 7 C is a closed 
segment [z, u], where u erb C and [z, uf is in ri C, cf. Theorem 3.3. Now, by 
Theorem 4.3 there is a proper supporting hyperplane H of C such that ue H. 
The supporting halfspace K bounded by H then has the desired property. In 
fact, suppose that x € K. As we have z Eri C, it follows from Corollary 4.2 that 
z ¢ H, whence zeint K. But then Theorem 3.3 shows that ]z, x[ is in int K, 
which is contradicted by the fact that the point u belonging to ]z, x[ is in 
H = bd K. = 


One may say that Theorem 4.5 describes an “external representation” ofa 
closed convex set. In the next section we shall meet an “internal representation” 
of a compact convex set. 


EXERCISES 


4.1. Let C, and C, be convex sets in R*. A hyperplane H in R’* is said to separate C, and 
C, if C, is contained in one of the two closed halfspaces bounded by H and C, is 
contained in the other closed halfspace bounded by H. Note that we allow C, < H 
and C, < H. If at least one of the two sets C, and C, is not contained in H, then H 
is said to separate properly. Show that there exists a hyperplane H separating C, 
and C, properly if and only if riC, a ri C, = @. (Hint: Consider the convex set 
C:=C, — C,. Use Exercise 3.2.) 


4.2. Let C,; and C, be convex sets in R*. A hyperplane H(y, «) is said to separate C, 
and C, strongly if for some ¢ > 0 both H(y, a — «) and H(y, a + €) separate C, 
and C,, cf. Exercise 4.1. Show that there exists a hyperplane H separating C, and 
C, strongly if and only if o ¢ cl(C, — C,). Deduce, in particular, that if C, and C, 
are disjoint closed convex sets one of which is compact, then there is a strongly 
separating hyperplane. 


§5. The Facial Structure of a Closed Convex Set 


In this section we shall study certain “extreme” convex subsets of a closed 
convex set C, called the faces of C. We shall prove, among other things, that 
when the set C is compact, then it is the convex hull of its 0-dimensional faces. 
This is the “internal representation” mentioned in Section 4. 
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In the following, let C be a closed convex set in R*. A convex subset F of C 
is called a face of C if for any two distinct points y, z € C such that ]y, z[ A F is 
non-empty, we actually have [ y,z] < F. Note that in order to have[y,z] c F 
it suffices by the convexity of F to have y, ze F. 

The subsets @ and C of C are both faces of C, called the improper faces; all 
other faces are called proper faces. 

A point x € C 1s called an extreme point of C if {x} 1s a face. This means, by 
definition, that x is not a relative interior point of any segment [y, z] in C, 
or, equivalently, that C\ {x} 1s again convex. The set of extreme points of C is 
denoted by ext C. 

A face F of Cis called a k-face ifdim F = k. Thus, the 0-faces are the extreme 
points. (Strictly speaking, {x} 1s a face if and only if x 1s an extreme point.) A 
facet of C is a face F with O < dim F = dim C — 1. 

It is clear that the intersection of any set .& of faces of C is again a face of C. 
Hence, there is a largest face of C contained in all the members of ./, namely, 
the intersection of all the members of .°. However, we can also conclude that 
there is a smallest face of C containing all the members of .~, namely, the 
intersection of all faces of C containing all the members of .o. (Note that C 
itself is such a face.) Denoting the set of all faces of C by ¥(C), we may express 
this by saying that the partially ordered set (A(C), <) is a complete lattice 
with the lattice operations 


inf J :=() {Fe F(C)|Fe A}, 
sup = (\ {Ge F(C)|VF ea: F< Gh. 


(For lattice-theoretic notions, see Appendix 1.) We shall call (4(C), <) the 
face-lattice of C. (The partially ordered set (¥(C), >) is, of course, also a 
complete lattice. However, when speaking of the face-lattice of C we always 
mean ¥(C) equipped with c.) 

When C is aclosed convex set with dim C > 1, then certain faces of C have 
a particular form: If H is a proper supporting hyperplane of C, cf. Corollary 
4.2, then the set F:= H ~ C 1s a proper face of C. In fact, F is a non-empty 
proper subset of C by definition, and being the intersection of two convex 
sets it is also convex. To see that it has the face property, let y and z be two 
points of C such that ]y, z[ m F is non-empty. Then (1 — 4)y + Az is in H for 
some /A€ ]0, I[. Now, there are u and « such that H = H(u, «) and Cc 
K(u, «). We then have <y,u> < a, <z,u> <aand <(1 — A)y + Az, D =a, 
whence <y, u> = <z,u> = a,1e. yand zarein H, and therefore in F, as desired. 
A face F of C of the form F = H 1 C, where H is a proper supporting hyper- 
plane of C, is called a (proper) exposed face. For any closed convex set C 
(including sets C with dim C = —1, 0) it is convenient also to consider @ and 
C as exposed faces of C; we shall call them improper exposed faces. 

(There is a formal problem in connection with the definition of a proper 
exposed face of C, namely, that it depends on the choice of the particular 
affine space containing C. If C is “initially” lying in R*, we would like to be 
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free to consider it as a subset of any affine subspace A of R* containing 
aff C. We can, however, easily get away with this difficulty, since the hyper- 
planes in A are just the non-empty intersections H 4 A, where H is a hyper- 
plane in R¢ not containing A.) 

A point x € C is called an exposed point of C if {x} is an exposed face. The 
set of exposed points of C is denoted by exp C. Thus, exp C is a subset of ext C. 

The set of exposed faces of C is denoted by &(C). The order-theoretic 
structure of (€(C), <) will be discussed later in this section. 

In order to illustrate the notions introduced above, consider the following 
example. Let C be the convex hull of two disjoint closed discs in R* having the 
same radius. Then the boundary of C consists of two closed segments [x,, x, ] 
and [x3, x,], and two open half-circles. The 1-faces of C are the two segments 
[x1, X2], [x3, x4]; these faces are in fact exposed. The extreme points (i.e. the 
0-faces) are the points x,, x, x3, x4, and the points belonging to one of the 
open half-circles. Clearly, each point belonging to one of the open half-circles 
is even exposed. The extreme points x,, X,, x3, X4 are not exposed however; 
in fact, a supporting hyperplane of C containing one of the points x,, x5, x3, 
x4 must also contain one of the two segments. In particular, this shows that 
in general there are non-exposed faces. 

Any proper exposed face is the intersection of two closed sets, and there- 
fore it is closed itself. We actually have: 


Theorem 5.1. Every face F of a closed convex set C in R? is closed. 


PROOF. For dim F = —1, Othere is nothing to prove. Assume that dim F > 1, 
and let x be any point in cl F. Let xg be a point in ri F, cf. Theorem 3.1. If 
X = Xo, we have x € F as desired. If x # xq, then [xg, x[ is a subset of ri F by 
Theorem 3.3. In particular, ]x9, xl A F # @, whence x is in F by the defini- 
tion of a face. C] 


Theorem 5.1 shows among other things that it makes sense to talk about 
“a face of a face” (of a closed convex set): 


Theorem 5.2. Let F be a face of a closed convex set C in R‘, and let G be a subset 
of F. Then G is a face of C if (and only if) G is a face of F. 


ProoF. It follows immediately from the definition that if the set G is a face of 
C, then it is also a face of F. Conversely, suppose that G is a face of F, and let 
y and z be points of C such that ]y, z[ intersects G. Since G c F, the segment 
ly, z[ also intersects F. This implies y, z € F since F is a face of C. But then we 
also have y, z € G, as desired, since G is a face of F. L] 


One should note that the “if” part of Theorem 5.2 is not valid in general 
with “face” replaced everywhere by “exposed face.” In fact, in the example 
above x, is an exposed point of [x,, x], and [x,, x, ] is an exposed face of C, 
but x, is not an exposed point of C. 
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Theorem 5.3. Let F be a face of a closed convex set C in R* such that F # C. 
Then F crbC. 


Proor. For dim C = —1,0 there is nothing to prove. So, assume that we 
have dim C > 1. Let F bea face of C such that some point x from F isinri C. 
We shall complete the proof by showing that F = C. Let y be an arbitrary 
point in C. If y = x, then y is in F, as desired. If y # x, then there is a point 
z in C such that xe ]y, z[, cf. Theorem 3.5, (a) = (c). Since x is in F, and F 
is a face, it follows that y is in F. a 


Corollary 5.4. Let F and G be faces of a closed convex set C in R* such that 
G&F.ThenG crbF. 


PRooF. First note that G is a face of F, cf. Theorem 5.2. The statement then 
follows immediately from Theorem 5.3. LJ 


Corollary 5.5. Let F and G be faces of a closed convex set C in R* such that 
G & F. Then dim G < dim F. 


PrRooF. First note that we have aff G c aff F since G c F. Suppose that 
aff G = aff F. Thenri G c ri FsinceG c F.Combining with Corollary 5.4 we 
obtain riG = @. By Theorem 3.1 this implies G = @, whence also F = © 
since aff F = aff G, contradicting that G # F by assumption. In conclusion, 
we must have aff G ¢ aff F, whence dim G < dim F. CI 


For any subset M of a closed convex set C in R* there is a smallest face of C 
containing M, namely, the intersection of all faces containing M. Theorem 5.3 
shows that when M contains a point from ri C, then the smallest face con- 
taining M is C itself. 


Theorem 5.6. Let C be a closed convex set in R‘4, let x be a point in C, and let F 
be a face of C containing x. Then F is the smallest face of C containing x if and 
only if x eri F. 


Proor. If x eri F, then F is the smallest face containing x by Corollary 5.4. If 
x erb F, then by Theorem 4.3 there is a face G (in fact, exposed) of F such that 
x€G & F. By Theorem 5.2, G is also a face of C, and therefore F is not the 
smallest face containing x. LJ 


Corollary 5.7. Let C be a closed convex set in R*. Then the sets ri F, where 
Fe F(C)\{O}, form a partition of C. 


PROOF. The statement amounts to saying that for each x € C there is a unique 
face F of C such that x e ri F. However, Theorem 5.6 gives such a unique face, 
namely, the smallest face of C contaiming x. LI 
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Next, we shall study the exposed faces. We first prove: 


Theorem 5.8. Let F be afacet of aclosed convex set C in R*. Then F is an exposed 
face. 


PRroor. By the definition of a facet we necessarily have dim F > 0, whence 
by Theorem 3.1 there is a point x eri F. Then, by Theorem 5.6, F is the 
smallest face of C containing x. On the other hand, Theorem 4.3 shows that 
there is an exposed face G of C such that x € G. It then follows that F < G& C. 
Using Corollary 5.5 we obtain 


dim C — 1 =dimF < dimG < dim C, 


whence dim G = dim F. Corollary 5.5 then shows that F = G, and therefore 
F is exposed. LJ 


At the beginning of this section we noted that the intersection of any set of 
faces of a closed convex set C is again a face of C. The following theorem 
shows that a similar result holds for exposed faces: 


Theorem 5.9. Let {F; |i¢ I} be a set of exposed faces of a closed convex set C in 
Rand let 


F:= ()\ F;. 


ie] 
Then F is also an exposed face of C. 


PRooF. When F is @ or C, there is nothing to prove. So, in the following we may 
assume that F is a non-empty intersection of proper exposed faces F,, i¢ I. 


We shall first consider the case where J is a finite set, say J = {1,..., n}. 
Now, for each ie I there is a hyperplane H(y,, «;) such that 
F; = HY, %) AC (1) 
and 
Cc K(jj, 4%). (2) 


We may assume without loss of generality that o € int C. Then a is interior for 
all the K(y;, «;)’s, and therefore each «; is > 0. Letting 


Zi = OF Yi 
for i = 1,...,n, (1) and (2) become 
F, = A(z;, 1) AC, 
Cc K(z,, 1). 
Let 


34 1. Convex Sets 


Then for any x € C we have 


<x, Zp> = (x y tz) 


= yy ne <x, Zi> 
i= 
< )an'-1=1, (3) 
i=l 
implying that 
Cc K(Zo, 1). 
Furthermore, we have equality in (3) if and only if x € H(z,, 1)fori = 1,...,n. 


This shows that 
H(z), l)DAC = F. 


Hence, F is an exposed face. 
When / is infinite, it suffices by the preceding to prove that there exist 
i,,...,1, € 1 such that 


() F;. = F, 
v=1 
Let i, be any of the ?’s in J. If F = F;,, we have the desired conclusion. If 
F & F,,, then there is i, € J such that 
FCF,OF;,, & F;,. 
From Corollary 5.5 it follows that 
dim(F,, 0 F;,) < dim F;,. 


If F = F;, 0 F,,, we have the desired conclusion. If F & F;, © F;,, then there 


is i; € I such that 
FCF,OF,OF;, & Fi, 0 Fi- 
Again from Corollary 5.5 it follows that 
dim(F;, 0 Fj, 0 Fj,) < dim(F;, 0 F;,). 


IfF = F;, 0 F;, 0 F;,, we have the desired conclusion. If F ¢ F;, 0 F;, 0 F;i,; 
there is i, € I, etc. Since the dimension in each step is lowered by at least 1, we 
must end up with i,,..., i,¢J such that 


as desired. LJ 
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It follows from Theorem 5.9 that the partially ordered set (€(C), <) of 
exposed faces of a closed convex set C in R* is a complete lattice with 


inf SJ :=(\ {Fe é(C)|Fe a} 
sup J = () {GEé(C)|VFEeavx:F <G} 


for WY < &(C). It is interesting to note, however, that in general (6(C), <) is 
not a sublattice (cf. Appendix 1) of (A(C), <). In fact, when 7 is a subset of 
&(C), then sup .< computed in (6(C), <) may be different from sup 
computed in (A(C), <). (The inf-operation, however, is the same in 
(&(C), <)asin(A(C), <).) For example, it is not difficult to construct in R° 
a closed convex set C with the following properties. Among the extreme 
points of C there are three, say x,, x2, x3, such that conv{x,, X2, x3} 1s an 
exposed face, x, and x, are exposed points, but the face [x,, x,] 1s not 
exposed. (See Figure 2.) Then if we consider the subset .o of &(C) consisting 
of the two exposed faces {x,} and {x,}, we see that sup .¥ in (¥(C), <) 
is [x,, X.], whereas sup .W in (&(C), <) is conv{x,, Xz, X3}. 


X2 


Figure 2 


The final theorem of this section deals with extreme points. Closed half- 
spaces and affine subspaces are closed convex sets without extreme points. 
We shall prove that compact convex sets are “spanned” by their extreme 
points. This result is known as Minkowski’s Theorem: 


Theorem 5.10. Let C be a compact convex set in R*, and let M be a subset of C. 
Then the following two conditions are equivalent: 

(a) C=conv M. 

(b) ext Cc M. 


In particular, 
(c) C = conv(ext C). 


PROOF. Suppose that there is an extreme point x of C which is not in M. Then 
M is a subset of C\ {x}, and since C\ {x} 1s convex by the definition of an 
extreme point, it follows that conv M 1s also a subset of C\ {x}. This proves 


(a) = (b). 
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To prove (b) = (a) it suffices to show that 
C c conv(ext C). (4) 


(In fact, suppose that (4) holds. Since the opposite inclusion of (4) 1s obvious, 
it then follows that C = conv(ext C). But then we also have C = conv M for 
any subset M of C containing ext C.) We shall prove (4) by induction on the 
dimension of C. For dim C = —1, Othere is nothing to prove. For dim C = 1 
the statement is clearly valid. Suppose that the statement is valid for all com- 
pact convex sets of dimension <e, where e > 2, and let C be a compact 
convex set of dimension e. Let x be any point in C; we shall prove that x is a 
convex combination of extreme points of C, cf. Theorem 2.2. If x itself is an 
extreme point, there is nothing to prove. If x is not an extreme point, then 
there is a segment in C having x in its relative interior. Extending the segment, 
if necessary, we see that there are in fact points yo, y, Erb C such that 
x € |¥o, y,[. Let Fy and F, be the smallest faces of C containing yp and y,, 
respectively. Then F, and F,, are proper faces of C, cf. Corollary 5.7. They are, 
in particular, compact convex sets, cf. Theorem 5.1, and they both have 
dimension <e, cf. Corollary 5.5. Then, by the induction hypothesis, there are 
points Xo1,...,Xo9, € ext Fy and X,,,...,X,, € ext F, such that yo Is a convex 
combination of the x9;’s and y, is a convex combination of the x, ;’s. Since x is 
a convex combination of yo and y,, it follows that x is a convex combination 
of the xo;'s and x, ,'s. To complete the proof, we note that the xo,’s and x, ;s are 
in fact extreme points of C; this follows from Theorem 5.2. 


Corollary 5.11. Let C be a compact convex set in R‘ withdim C = n. Then each 
point of C is a convex combination of at most n + 1 extreme points of C. 


PRooF. Combine Theorem 5.10(c) and Corollary 2.4. L 


EXERCISES 
5.1. Show that ext C is closed when C is a 2-dimensional compact convex set. 
5.2. Let C be the convex hull of the set of points (o,, «), #3) € R® such that 
,= a4, = 0, a,ée[—1, 1], 
or 
a; = 0, (a, —1)? +03 = 1. 
Show that ext C is non-closed. 


5.3. Let C be a closed convex set in R*. Show that if a convex subset F of C is a face 
of C, then C\F is convex. Show that the converse does not hold in general. 


5.4. Let C be a non-empty closed convex set in R’. An affine subspace A of R’ is said to 
support Cif A A C # @ and C\A is convex. Show that the supporting hyperplanes 
of C in the sense of Section 4 are the hyperplanes that support C in the sense just 
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defined. Show that for a non-empty convex subset F of C, the following three 
conditions are equivalent: 


(a) F is a face of C. 
(b) There is a supporting affine subspace A of C such that AN C = F. 
(c) aff F is a supporting affine subspace of C with (aff F) 0 C = F. 


(The equivalence of (a) and (b) throws some light upon the difference between faces 
and exposed faces.) 


5.5. Let C be a compact convex set in R%, and let M be a subset of ext C. Show that 
conv M isa face of C if and only if 


(aff M) m conv((ext C)\\M) = ©. 
5.6. Show that there are compact convex sets C such that 
C # conv(exp C). 
Prove Straszewicz’s Theorem: For any compact convex set C one has 
C = cleonv(exp C). 


(Warning: This is not easy.) 


$6. Polarity 


Duality plays an important role in convexity theory in general, and in poly- 
tope theory in particular. Actually, we shall be working with two duality 
concepts: a narrow one called polarity and a broader one which we shall 
simply refer to as duality. The notion of polarity applies to convex sets in 
general, whereas duality in the broader sense will only be applied to polytopes. 

This section deals with polarity. With each subset M of R%, we shall 
associate a certain closed convex subset M° of R*, called the polar of M. When 
C is acompact convex set having o in its interior, then the polar set C° has the 
same properties, and C is the polar of C°. For such a pair of mutually polar 
compact convex sets having o as interior point, the polar operation induces 
a one-to-one inclusion reversing correspondence between &(C) and &(C°). 

One should note that the notion of polarity is a linear concept, while in the 
preceding Sections 2-5 we worked within the framework of affine spaces. In 
particular, the polar operation is not translation invariant. 


For any subset M of R‘, the polar set is the subset M° of R¢ defined by 
M°:= {ye R1IV¥xeM:<x,y> < 1} 


= {ye R*|supyeu (x,y) < I}. 
Equivalently, 


M° = () K(x, 1). (1) 


xeM 
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Since y 1s in K(x, 1)ifand only if x is in K(y, 1), it follows from (1) that we have 
yeM’=+M c K\y, 1). (2) 


It also follows from (1) that M° is a closed convex set containing o, since each 
K(x, 1) is such a set. Furthermore, it is clear that 


M,< M,=>Mj;3 > M3. (3) 
We shall prove the following: 


Theorem 6.1. For any subset M of R¢ one has: 


(a) If M is bounded, then o is an interior point of M°. 
(b) Ifo is an interior point of M, then M° is bounded. 


Proor. For z € R* and r > 0 we denote by B(z, r) the closed ball centred at z 
with radius r, 1.e. 
B(z, r):= {x € R@| ||x — z|| < r}. 


Here ||-|| denotes the Euclidean norm, Le. 


|u| = V Cu, u». 
Now, it is an elementary standard fact that 


sup <x, y) = rllyl| 


xe B(o,r) 
for all ye R4 and r > O. This shows that 
Blo, ry’ = Bo, r~*). (4) 


Therefore, if M is bounded,1.e.M < Bio, r) for somer > O, then using (3) and 
(4) we see that B(o, r-') < M°, showing that o is an interior point of M°. 
This proves (a). Next, if o is an interior point of M,1.e. B(o, r) < M for some 
r > 0, then again using (3) and (4) we obtain M° c B(o, r~'), showing that 
M° is bounded. This proves (b). LJ 


The polar operation can, of course, be iterated. We write M°° instead of 
(M°)°. The set M°° is called the bipolar of M. It can be described as follows: 


Theorem 6.2. For any subset M of R? we have 
M°° = clconv({o} U M), 
i.e. M°° is the smallest closed convex set containing o and M. 
PROOF. We have 
M*=(\ KQ,1)= () KO,D, (5) 


y e M® Mc K(y, 1) 


cf. (1) and (2). This formula immediately implies that M°° is a closed convex 
set containing o and M, whence M°° contains clconv({o} U M). To prove 
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the opposite inclusion, let z be a point not in clconv({o} U M); we shall prove 
that there is a closed halfspace K(u, 1) containing M such that z ¢ K(u, 1), cf. 
(5). By Theorem 4.5 there is a supporting halfspace K(y, «) ofclconv({o} U M) 
such that z ¢ K(y, ~). We then have 


max{<x, y>|x eclconv({o} U M)} = « < <z, v>. 


Since o is in clconv({o} U M), we have a > 0. Therefore, there exists B > 0 
such that 
max{<x, y>|x eclconv({o} U M)} < B < <z, y>. (6) 


Taking u := B~ *y, we obtain from (6) 
max{<x, u»|x eclconv({o} U M)} < 1 < <z,u), 


implying M c K(u, 1) and z¢ K(u, 1), as desired. LI 
From Theorems 6.1 and 6.2 we immediately get: 


Corollary 6.3. Let C be a compact convex set in R* having o as an interior point. 
Then C° is also a compact convex set having o as an interior point. Furthermore, 
C=C. 


In the following, C is assumed to be a compact convex set in R* with 
o € int C. To emphasize the completely symmetric roles played by C and C*, 
as explained by Corollary 6.3, we denote C° by D. 

The assumption o € int C implies that every supporting hyperplane of C is 
a proper supporting hyperplane, and has the form H(y, 1) for a unique 
ye R*\{o}. We then have C c K(y, 1), and hence yeD. The following 
theorem gives more information about this situation: 


Theorem 6.4. For any y € R‘, the following two conditions are equivalent: 


(a) H(y, 1) is a supporting hyperplane of C. 
(b) yebd D. 


Similarly, for any x € R*, the following two conditions are equivalent: 


(c) H(x, 1) is a supporting hyperplane of D. 


(d) xebd C. 

PROOF. If (a) holds, then y € D and 
sup <x, y> = l. (7) 
xeC 


(Actually, the supremum is a maximum.) If we had ye int D, then we would 
also have Ay € D for acertain J > 1. Since D is the polar of C, we would then 
have 

sup <x, Ay> < 1, 


xeCc 


contradicting (7). Hence ye D\int D = bd D, as desired. 
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Conversely, if (b) holds, then, in particular, y is in D\{o}. Since D is the 
polar of C, we then have 


0 <sup<x, y> < 1. (8) 


xeCc 


Now, if in (8) we had <1, then we would have 


sup <x, Ay> = 1 

xeC 
for a suitable A > 1, whence Ay would be in C° (=D). Since 0 € int D and 
yeé jo, Ay[, this would imply yeint D by Theorem 3.3, a contradiction. 
Therefore, 

sup <x, y> = 1. 

xeC 
Finally, this supremum is actually a maximum by the compactness of C and 
the continuity of <-, y>. Hence, H(y, 1) is a supporting hyperplane of C, as 
desired. 

As explained earlier, C and D play completely symmetric roles. Therefore, 

the equivalence of (c) and (d) is a consequence of the equivalence of (a) and (b). 


L 


Corollary 6.5. For any x, y € R’, the following four conditions are equivalent: 
y 


(a) H(y, 1) is a supporting hyperplane of C at x. 
(b) H(x, 1) is a supporting hyperplane of D at y. 
(c) <x, y> = 1,xeEbd C, yebd D. 

(d) <x,y> = 1,xEC, yeD. 


PROOF. The equivalence (a)<>(c) follows immediately from Theorem 
6.4, (a) <> (b). The equivalence (b) <> (c) then follows by symmetry, or from 
Theorem 6.4, (c) <> (d). It is trivial that (c) = (d). We shall complete the proof 
by showing that (d) = (a). From ye D(=C?) it follows that C < K(y, 1), and 
from <x, y> = 1 it follows that x € H(y, 1). Since x € C, it then follows that 
A(y, 1) 1s a supporting hyperplane of C at x. LJ 


Now, for an exposed face F of C, proper or improper, we define 
F* := {ye D|Vx Ee F: <x, y> = 1}. 
Similarly, for an exposed face G of D we define 
G* = {xEeC|VyeG: <x, y> = 1}. 


The motivation for this concept is the fact that when F is a proper exposed 
face of C, then a point ye R’ is in F° if and only if H(y, 1) is a supporting 
hyperplane of C with F < H(y, 1); this follows immediately from Corollary 
6.5, (a) <> (d). The same holds for a proper exposed face G of D. For the 
improper exposed faces C and @ of C, we have C* = & and @* = D. And 
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for the improper exposed faces D and @ of D, we have D* = and O” = C. 
(The unpleasant feature that @” may have different “values” is, of course, 
due to the fact that we use the same notation for different mappings.) 


Theorem 6.6. Let F be a proper exposed face of C. Then F° is a proper exposed 
face of D. Similarly for a proper exposed face G of D. 


ProoF. By definition, 


F* = ()DO H(x, 1). 
xeF 
When F is proper, then each x € F is in bd C, whence H(x, 1) is a supporting 
hyperplane of D, cf. Theorem 6.4, (d) = (c). Therefore, each set D ~ H(x, 1) 
is a proper exposed face of D, implying that F° is an exposed face of D, 
cf. Theorem 5.9. Furthermore, F° is proper or empty. But since F is a proper 
exposed face, there is a supporting hyperplane H(y, 1) of C such that 
F = CO HV, 1). From the remark above following the definition of F°, 
we then see that ye F°, whence F° # @. L] 


By Theorem 6.6, it makes sense to iterate the A\-operation. Writing F°* 
instead of (F”)°, we see that for the improper exposed faces C and @ of C we 
have C°* = C and @** = @. Moreover, by Theorem 6.6, F°* is a proper 
exposed face of C when F is a proper exposed face of C. We actually have: 


Theorem 6.7. Let F be a proper exposed face of C. Then F°* = F. Similarly for 
a proper exposed face G of D. 


PROOF. By definition, 


F** = () Ca Hy, 1). 
ye F* 
But since y is in F° if and only if H(y, 1) is a supporting hyperplane of C with 
F c H(y, 1), we see that F°° is the intersection of all proper exposed faces of 
C containing F. This intersection, of course, is simply F itself. LJ 


For an exposed face F of C, we call the exposed face F° of D the conjugate 
face of F; the same applies to an exposed face G of D. Theorems 6.6 and 
6.7 show that the exposed faces of C and D go together in pairs F, G of 
mutually conjugate faces, both proper or both improper. 

It is clear that the A-operation reverses inclusions. The following is, 
therefore, a consequence of Theorems 6.6 and 6.7: 


Corollary 6.8. The mapping F +> F°, where F € &(C), is an anti-isomorphism 
from(&(C), <)onto(&(D), <), and the mapping Gt> G*, where Ge &(D), is an 
anti-isomorphism from (&(D), <) onto (&(C), <). The two mappings are 
mutually inverse. 
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Anti-isomorphisms reverse inf and sup. Therefore, Corollary 6.8 yields: 


Corollary 6.9. Let {F;|i¢ I} be a set of exposed faces of C, let Fy denote the 
largest exposed face of C contained in all the F,’s (i.e. F 9 is the intersection of the 
F's), and let F , denote the smallest exposed face of C containing all the F;'s. Then 
FG is the smallest exposed face of D containing all the F;’s, and F‘ is the largest 
exposed face of D contained in all the F;’s (i.e. F{ is the intersection of the 
F;’s). Similarly for a set of exposed faces of D. 


We remind the reader that for some time we have been working under the 
general assumption that C and D are mutually polar compact convex sets in 
R? having o as an interior point. This assumption is maintained in the follow- 
ing theorem. (Among other things, this explains the meaning of d in the 
formula.) 


Theorem 6.10. Let F and G be a pair of mutually conjugate faces of C and D, 
respectively. Then 


dim F + dimG <d — 1. 


Proor. The conjugate face of the improper exposed face @ of C is the im- 
proper exposed face D of D. Similarly, the conjugate face of the improper 
exposed face C of C is the improper exposed face @ of D. Sincedim @ = —1, 
dim C = danddim D = d, wesee that the formula holds when F is improper, 
in fact with equality. Consequently, we need only consider the case where F is 
a proper exposed face of C; then the conjugate face G of D is also proper, cf. 
Theorem 6.6. Now, by the definition of the A\-operation, 


G=Do ()\ AG, )). 


xeF 
Therefore, G is a subset of the affine subspace ( ),.. - H(x, 1), whence 


dim G < dim () H(x, 1). (9) 


xeF 


By (9), the affine subspace (),.- H(x, 1) is non-empty; therefore it is a 
translate of the linear subspace ( ),.-- H(x, 0), and so 


dim ()\ H(x, 1) = dim ()) H(x, 0). (10) 


xeF xeF 


But 
()\ H(x, 0) = {ye R*|Vx EF: <x, y> = 0} 


xeF 


= Ft = (span F)"*. 


$6. Polarity 43 


Therefore, 
dim () H(x, 0) = dim((span F)") 


xeF 

= d — dim(span F) 

= d — (dim(aff F) + 1) 

= d — 1 — dim(aff F) 

=d—1-—dimF, (11) 
where we have used the fact that o ¢ aff F to obtain 

dim(span F) = dim(aff F) + 1. 

Combining now (9), (10) and (11), we obtain the desired formula. L] 


EXERCISES 
6.1. Show that (AM)° = A~'M° when A # 0. 
6.2. Show that (M°°)° = M°. 
6.3. Show that ((J;.;Mj° = (\ier Mi. 
6.4. Show that 
fi ci) = cleonv |) C? 
iel iel 
when the sets C; are closed convex sets containing o. 


6.5. For e < d, identify R® with the subspace of R* consisting of all (x,,..., x4) € R’ 
such that x,,, =+-: = Xq = 0. Let I denote the orthogonal projection of R* onto 
R°. Show that for any subset M of R4 we have 
(MM)? = Moa RS, 
where II(M)° denotes the polar of T1(M) in R° and M° denotes the polar of M in 
R4. 
6.6. Let C and D be mutually polar compact convex sets. Let F be a proper exposed 
face of C, and let G:= F°. Show that 
G=Dnon ()\ H@,}), 
xeext F 


and show that 
G=Do H(xo, 1) 
for any relative interior point xq of F. 


6.7. Let C and D be mutually polar compact convex sets. Extend the definition of the 
Z\-operation by allowing it to operate on arbitrary subsets of C and D. Show that 
when M is a subset of C, then M°*:=(M*)° is the smallest exposed face of C 
containing M. 


CHAPTER 2 
Convex Polytopes 


§7. Polytopes 


A (convex) polytope is a set which is the convex hull of a non-empty finite set, 
see Section 2. We already know that polytopes are compact. We may, 
therefore, apply Section 5 on the facial structure of closed convex sets to 
polytopes. As one might expect, the facial structure of polytopes is consider- 
ably simpler than that of convex sets in general. 


A polytope P = conv{x,, ..., x,} is called a k-polytope if dim P = k. 
This means that some (k + 1)-subfamily of(x,,...,x,) is affinely independent, 
but no such (k + 2)-subfamily is affinely independent. By a k-simplex we mean 
a k-polytope which is a simplex. A simplex is a k-simplex if and only if it has 
k + 1 vertices, cf. Section 2. A 1-simplex is a closed segment. A 2-simplex is 
called a triangle, a 3-simplex is called a tetrahedron. 

We have the following description of polytopes in terms of extreme points: 


Theorem 7.1. Let P be anon-empty subset of R*. Then the following two condi- 
tions are equivalent: 


(a) P is a polytope. 
(b) P is acompact convex set with a finite number of extreme points. 


PROOF. When P is a polytope, say P = conv{x,,..., X,}, then P is compact 
by Corollary 2.9. Next, Theorem 5.10, (a) = (b) shows that ext P is a subset of 
{X1,.--,X,}, and hence is a finite set. The converse follows immediately from 
Theorem 5.10, (b) = (a). LJ 
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Following common usage, we shall henceforth call the extreme points, Le. 
the 0-faces, of a polytope P the vertices of P. We shall continue to denote the 
set of vertices of P by ext P. The 1-faces are called the edges of P. 

The vertices of a simplex S in the sense used in Section 2 are, in fact, the 
extreme points (i.e. vertices) of S. This follows immediately from Theorem 
5.10 or Theorem 7.2 below. 

The set {x,,..., x,} Spanning a polytope P = conv{x,, ..., x,} 1s of 
course not unique (except when P is a 1-point set); in fact, one may always 
add new points x,,,,... already in P. However, there is a unique minimal 
spanning set, namely, the set ext P of vertices of P: 


Theorem 7.2. Let P be a polytope in R‘, and let {x,,..., X,} be a finite subset 
of P. Then the following two conditions are equivalent: 


(a) P = conv{x,,..., X,}. 
(b) ext Pc {x,,..., X,}. 


In particular, 

(c) P = conv(ext P). 

Proor. Noting that polytopes are compact, the statement follows immedi- 
ately from Theorem 5.10. L 


We shall next study the facial structure of polytopes in general. 


Theorem 7.3. Let P be a polytope in R‘, and let F be a proper face of P. Then F 
is also a polytope, and ext F = F next P. 


PROOF. We begin by noting that P and F are compact, cf. Theorem 7.1, (a)=(b) 
and Theorem 5.1. Now, Theorem 5.2 shows that the extreme points of F are 
just those extreme points (vertices) of P which are in F,1.e.ext F = F next P. 
Since ext P is a finite set by Theorem 7.1, (a) = (b), it follows that ext F isa 
finite set. Application of Theorem 7.1, (b) = (a) completes the proof. LJ 


Corollary 7.4. Let P be a polytope in R*. Then the number of faces of P is 
finite. 


PROOF. The number of extreme points of P is finite by Theorem 7.2, (a) = (b). 
Each face of P is the convex hull of extreme points of P by Theorem 7.3 and 
Theorem 7.2(c). Therefore, the number of faces is finite. LJ 


The following is a main result: 


Theorem 7.5. Let P be a polytope in R*. Then every face of P is an exposed face. 


ProoF. It suffices to prove the statement for d-polytopes in R*. We shall use 
induction on d. For d = 0 there is nothing to prove, for d = 1 the statement 
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is trivial, and for d = 2 it is obvious. Suppose that the statement is valid for 
all polytopes of dimension <d, where d > 3, and let P be a d-polytope in 
R*. For improper faces of P there is nothing to prove, so let F be a proper face 
of P. Let x be a relative interior point of F, cf. Theorem 3.1, and let H be a 
proper supporting hyperplane of P at x, cf. Theorem 4.3. Then Hm P is a 
proper exposed face of P containing x. Using Theorem 5.6, we see that 
FcHoP. If F = Ho P, then F is exposed, as desired. If F | HAP, 
then F is a proper face of HA P, cf. Theorem 5.2. (See Figure 3.) Since 


Figure 3 
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dim(H 7 P) < d, and Hc P is a polytope, cf. Theorem 7.3, it follows from 
the induction hypothesis that there is a proper supporting hyperplane H’ 
of H © P in aff(H 2 P) such that F = H’ 0 (AH - P). This hyperplane H’ we 
may extend to a hyperplane A in H such that 


F=AQ0P. (1) 


Note that dim A = d — 2 > 1. Let B be a 2-dimensional affine subspace of 
R4 which is orthogonal to A, and let x denote the orthogonal projection of 
R¢ onto B. Then z(A) is a 1-point set. Furthermore, z(P) is a 2-polytope in B. 
We claim that (A) is a vertex of x(P). If not, then there are points y and z 
in P such that z(y) # z(z) and 


mA) = (1 — A)n(y) + An(z) 
for some Ae ]0, 1[. Let 
u:=(1 — A)y + Az. 


Then wu is in P, and x(u) = x(A), whence u is in A since 2 ‘(x(A)) = A. 
Therefore, u is in F, cf. (1). Since F is a face of P, it follows that y and z are 
in F. But F is a subset of A, whence z(v) = x(A) for all ve F. In particular, 
m(y) = 2(z), a contradiction which proves that 2(A) is a vertex of x(P). By 
the 2-dimensional version of the theorem we then see that there is a line 
L in B such that 


LO 2P) = n(A). 
Then 
H, :=aff(A UL) = 27 '(L) 


is a supporting hyperplane of P in R* with H, - P = F, as desired. LJ 


Corollary 7.6. Let P be a polytope in R‘. Then the two lattices (F(P), <) and 
(€(P), <) are the same. 


We shall finally introduce two particular classes of polytopes, the pyramids 
and the bipyramids, and we shall describe their facial structure. 
A pyramid in R* is a polytope—cf. Theorem 7.7(a)—of the form 


P = conv(Q U {Xo}), 


where Q is a polytope in R%, called the basis of P, and x, is a point of R*\ aff Q, 
called the apex of P. (Note that basis and apex need not be unique: a simplex 
is a pyramid where any facet may be taken as the basis, or, equivalently, 
any vertex may be taken as the apex.) A pyramid P is called an e-pyramid 
if dim P = e. Clearly, a pyramid P is an e-pyramid if and only if its basis Q 
is an (e — 1)-polytope. 
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The facial structure of a pyramid is determined by the facial structure of 
its basis as follows: 


Theorem 7.7. Let P be a pyramid in R¢ with basis Q and apex X,. Then the 
following holds: 


(a) P is a polytope with ext P = (ext Q) U {Xo}. 

(b) A subset F of P with xo ¢ F is a face of P if and only if it is a face of Q. 

(c) A subset F of P with x) &€F is a face of P if and only if there is a face G 
of Q such that F = conv(G vu {xo}), i.e. F = {xo} or F is a pyramid with a 
face G of Q as the basis and Xo as the apex. For each such face F of P, the 
face G is unique, and dim G = dim F — 1. 


PROOF. (a) The set 
P, := conv((ext Q) U {xo}) 


is a convex set containing Q and xo, cf. Theorem 7.2(c). Therefore, it contains 
P. On the other hand, it is clear that P, < P, whence 


P = conv((ext Q) U {Xxo}). 
This shows that P is a polytope and also implies that 
ext P c (ext Q) U {xo}, 


cf. Theorem 7.2, (a) = (b). To prove the opposite inclusion, we first remark 
that P is the union of all segments [y, x,.], where y € Q. It is then clear that if 
H, isa hyperplane with x) € Hj) and Hy, aff QO = @, then H, is a supporting 
hyperplane of P with Hy \ P = {xo}, implying that x, € ext P. To prove also 
that every x € ext Q is in ext P, we prove more generally that every proper 
face of QO is a face of P. Let F be a proper face of Q. Then there is a supporting 
hyperplane H of Q in aff QO such that H A Q = F. Let H, be a hyperplane in 
R? such that H, 0 aff Q = H and x, is on the same side of H, as Q\F. Then, 
again using the remark above that each point of P belongs to some segment 
[y, X)] with ye Q, we see that H, 7 P = F, whence F is a face of P. This 
completes the proof of (a). (A more direct way of showing that every (proper) 
face of Q is a face of P goes via the observation that Q is a facet of P. Our 
motivation for preferring the proof given above is the fact that after an 
obvious modification it also applies to the situation needed in the proof of 
Theorem 7.8 below.) 

(b) During the proof of (a) it was proved that every proper face of Q is a 
face of P. Since Q itself is also a face (in fact, a facet) of P, it follows that every 
face of Q is a face of P. 

Conversely, let F be a non-empty face of P not containing x9). By Theorem 
7.5 there is a supporting hyperplane H of P such that HP = F. Using 
(a) and Theorem 7.3 we see that ext F < ext Q, whence F <c Q. But then 
trivially F is a face of Q. 
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(c) We first prove that every set F of the form 
F = conv(G vu {Xo}), 


where G is a face of Q, is a face of P. We need only consider the case where G 
is a proper face of Q. For any such face G there is a supporting hyperplane H 
of Q in aff Q such that Hm Q = G. Let H, be a hyperplane in R® such that 
H, aff Q = H and x, €H,. Then clearly H, is a proper supporting hyper- 
plane of P, whence 


F, = A, CY P 
is a proper (exposed) face of P. Moreover, 


ext F, = H, next P 
= (H next Q)uU {xo} 
= (ext G) U {Xo}, 


where we have used Theorem 7.3 and (a). Then using Theorem 7.2(c) we get 
F, = conv((ext G) U {xo}) 


conv(G U {Xxo}) 
= F, 


whence F is a face of P, as desired. 

To prove the converse, we need only consider the case where F # {xo} 
and F # P. Let H be a supporting hyperplane of P such that F = Ho« P, 
cf. Theorem 7.5. Since P is the union of all segments [y, x) ], where y € Q, we 
see that F is the union of all segments [y, x) ], where ye H 1 Q. Letting 
G := Hc Q, it follows that 


F = conv(G vu {xo}), 
and it is clear that G is a face of Q. 
Finally, the uniqueness of G and the dimension formula are obvious. (J 
A bipyramid in R* is a polytope—cf. Theorem 7.8(a)—of the form 
P = conv(Q U {Xo, X1}); 


where Q is a polytope in R* with dim Q > 1, and xo, x, are two points of 
R*\ aff Q such that 


]x9,X1L 010 # SG. 


(Then actually ]x,, x,[ has precisely one point in common with ri Q.) The 
set Q is called the basis of P, and x9, x, are called the apices of P. (As in the 
case of pyramids, basis and apices are, in general, not unique.) A bipyramid 
P is called an e-bipyramid if dim P = e. Clearly, a bipyramid P is an e- 
bipyramid if and only if its basis Q is an (e — 1)-polytope. 
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The facial structure of a bipyramid 1s determined by the facial structure 
of its basis as follows: 


Theorem 7.8. Let P be a bipyramid in R¢ with basis Q and apices Xo and x). 
Then the following holds: 


(a) Pisa polytope with ext P = (ext Q) U {Xo, X;}. 

(b) A subset F of P with x), x, ¢F is a face of P if and only if it is a face of Q 
with F # Q. 

(c) A subset F of P with x) € F and x, € F is a face of P if and only if there is a 
face G of Q with G # QO such that F = conv(G vu {xo}), i.e. F = {Xo} or F 
is a pyramid with a face G of Q with G # Q as the basis and Xo as the 
apex. For each such face F of P, the face G is unique, and dim G = 
dim F — 1. Similarly for subsets F of P with x, €F and x) ¢ F. 

(d) A subset F of P withx,),x, €F is a face of P if and only if F = P. 


Proor. The proof follows the same lines as the proof of Theorem 7.7. The 
details are left to the reader. LJ 


EXERCISES 


7.1. Show that every polytope P with n vertices is the orthogonal projection of an 
(n — 1)-simplex. (This is to be understood as follows: “Embed” P in R"~!; con- 
struct an (n — 1)-simplex in R"”~’ whose orthogonal projection onto aff P is P.) 


7.2. Let 1 <n < d. Starting with a (d — n)-polytope Q in R*, we may successively build 
up pyramids P,, P,,..., P, by taking P, to be a (d — n + 1)-pyramid with Q asa 
basis, taking P, to be a(d — n + 2)-pyramid with P, as a basis, etc. The d-pyramid 
P,, is then called an n-fold d-pyramid with Q as a basis. Show that a (d — 1)-fold 
d-pyramid is also a d-fold d-pyramid; it is, in fact, a d-simplex. 


7.3. Copying Exercise 7.2, define the notion of a n-fold d-bipyramid. Show that a (d — 1)- 
fold d-bipyramid is also a d-fold d-bipyramid. 

(A particular type of d-fold d-bipyramids are the d-crosspolytopes; these are the 

convex hulls of 2d points a,,...,a,, b,;,..., b, such that all segments [a,, b;] have 

a common midpoint, and no [a,, b;] is contained in the affine hull of [a,, 5, ],..., 

[a;_,, b;- ,]. If the segments [a;, b;] are orthogonal and have the same length, then 

the d-crosspolytope is said to be regular. A 3-crosspolytope is called an octahedron.) 


7.4. A prism in R¢ is a polytope of the form 
P = conv(Q u (a + Q)), 


where Q is a polytope in R* with dim Q < d, and a + Q ¢ aff Q. Show that this 
definition is equivalent to the following: A prism in R¢ is a polytope of the form 


P=Q + [o, a], 


where Q is a polytope in R* with dim Q < d and a is a point in R*\ {0} such that 
the line through o and a is not parallel to aff Q. 


Show that 
ext P = ext Q U ext(a + Q). 
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Show that the faces of P are the faces of Q, the faces of a + Q. and the prisms 
F = conv(G vu (a + G)), 


where G is a face of Q. 


7.5. Copying Exercise 7.2, define the notion of an n-fold d-prism. Show that a(d — 1)-fold 
d-prism is a d-fold d-prism. 
(A d-fold d-prism is also called a d-parallellotope; equivalently, a d-parallellotope 
is a polytope of the form 
at Lo, b, ] toe Lo, ba], 


where 5b; is not contained in the affine hull of a + [o, b,] + --- + [o, b;_ ,]. If the 
segments [o, b;] are orthogonal and have the same length, then the d-parallellotope 
is called a d-cube.) 


§8. Polyhedral Sets 


In previous sections we have proved that every compact convex set C has an 
“external representation ” as the intersection of closed halfspaces, namely, the 
supporting halfspaces, and an “internal representation” as the convex hull 
of a (unique) minimal set, namely, the set of extreme points. (Actually, for the 
external representation compactness is not needed, closedness suffices.) The 
sets which have a “finite” internal representation are the polytopes. In this 
section we shall study the sets which have a “finite” external representation, 
i.e. sets which are intersections of a finite number of closed halfspaces. These 
sets are called polyhedral sets. The main basic fact in polytope theory 1s that 
the polytopes are precisely the non-empty bounded polyhedral sets. Part of 
this statement will be proved at the end of this section; the remaining part 
will be proved in the next section. 


A subset Q of R® is called a polyhedral set if Q is the intersection of a 
finite number of closed halfspaces or Q = RY‘. 

Every hyperplane H in R’‘ is the intersection of the two closed halfspaces 
which are bounded by H, and every affine subspace A of R4 with A # R¢ is 
the intersection of a finite number of hyperplanes. Therefore, every affine 
subspace of R? is polyhedral. 

Let Q be a polyhedral set in R*, and let A be an affine subspace of R? 
such that 0 < A # R“% Then Q is the intersection of a finite number of closed 
halfspaces in A or Q = A. This follows from the fact that if K is a closed 
halfspace in R* such that A A K # @, then A 7 K is a closed halfspace in A 
or AN K =A. 

Conversely, let A be an affine subspace of R4 with A # R‘, and let O bea 
subset of A such that Q is the intersection of a finite number of closed half- 
spaces in A or Q = A. Then Q is the intersection of a finite number of closed 
halfspaces in R* and hence polyhedral. This follows from the fact that for 


52 2. Convex Polytopes 


every closed halfspace K in A there is a closed halfspace K’ in R¢ such that 
Aa K' = K. 

Every polyhedral set 1s closed and convex. The intersection of a finite 
number of polyhedral sets is again polyhedral. Any translate of a polyhedral 
set is again polyhedral. The image of a polyhedral set under an affine mapping 
is again polyhedral. 

The facial structure of a (non-empty) polyhedral set Q in R’ is trivial when 
Q is an affine subspace of R’, the only faces being @ and Q. When Q is an 
e-dimensional polyhedral set in R* which is not an affine subspace, then Q 
is affinely isomorphic to a polyhedral set Q’ in R° with dim Q’ = e and 
O’ # R°. Therefore, when studying facial properties of polyhedral sets, it 
suffices to consider polyhedral sets Q in R* with dim Q = d and Q ¥ R‘. 

Every polyhedral set Q in R% has a representation 


QO = (K(x, 0) (1) 


In the following, when talking about a representation (1) of Q, we shall always 
implicitly assume that no two K(x,, 4;)’s are identical. For O # R* we may 
always assume that each K(x;, a;) is a closed halfspace, 1.e. each x; is #0. For 
QO = R‘ there is only one representation, namely, Q = K(o, «), where « > 0. 
Note that when Q # R® there are infinitely many representations (unless 
d = 0); new closed halfspaces containing Q may always be added. 

We shall call a representation (1) irreducible ifn = 1, orn > 1 and 


0 ¢ () K(xi, %), j=Hl,...,n. 
iF] 
A representation which is not irreducible is called reducible. Clearly, any 
reducible representation may be turned into an irreducible representation 
by omitting certain of the sets K(x;, «;). It follows from Theorem 8.2 below 
that there is only one irreducible representation of each polyhedral set Q 
which is not an affine subspace. 


Theorem 8.1. Let Q be a polyhedral set in R*? with dim QO = d and Q # R*. 
Let 


Q = [\ Kou. a;) 


be a representation of Q with n > 1, where each K(x;, 4;) is a closed halfspace. 
Then the representation is irreducible if and only if 


A(x, %;) O int () K(x;, %) # OW 
i=1 
L# j 


for eachj = 1,...,n. 
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ProoF. For j = 1,..., n, we let 


M,; = () K(x;, O;). 
iF) 
Then Q = K(x,, «;) 0 M, for each j, and since dim Q = d by assumption, 
we see that int M; 4 @; consequently, r1 M, = int M, and M,; ¢ H(x,, «,). 
The condition of the theorem reads 


A(x;,0)OmtM,;4A 0, jH=l,...,n. (2) 
By Theorem 4.1 and the observations above, (2) is equivalent to 
M,;¢ K(x;, 0 i), M,; 4 K(-x;, — a;), j= 1,...,7. (3) 


Now, M,; < K(—x,;, —a,;) would imply 
Qc K(x,, 4) 0 K(—xX;, —4,;) = H(x;, %)), 
a contradiction. Hence, (3) is equivalent to 
M, F K(x;, %;), j=l,...,n. (4) 
But (4) is just a rephrasing of irreducibility. LJ 


The following theorem shows that the boundary of a polyhedral set is 
built up in the expected way: 


Theorem 8.2. Let Q be a polyhedral set in R¢ with dim Q = d and Q # R*. Let 
Q = () K(x;, 4) (*) 
i=1 


be a representation of Q, where each K(x;, «;) is a closed halfspace. Then the 
following holds: 


(a) bd Q = (Ja, HX, %) OQ. 

(b) Each facet of Q is of the form H(x;, «;) 0 Q. 

(c) Each set H(x;,a;) 0 Q is a facet of Q if and only if the representation (*) is 
irreducible. 


PROOF. (a) We have 


int Q = int () K(x;, «;) 
i=1 


n 


int K(x,, a;) 
1 


il 


i 


I 
3 


K(x; ’ 0;)\ A(x; ’ ct) 


~« 
Il 


i=1 


which implies (a). 
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(b) Let F be a facet of Q. Let x be a relative interior point of F. Then F 
is the smallest face of Q containing x, cf. Theorem 5.6. By (a), there 1s j such 
that 

x € H(x;, 4) 0 Q. 


But then we must have 


Fc H(x;, 4) AQ, 


implying 
F = H(x;, 4) Q, 
cf. Corollary 5.5. 
(c) For n = 1 there is nothing to prove. So assume that n > 1. 
If (*) is irreducible, then each H(x;, «;) supports Q, whence H(x,, «;) 0 Q 
is a proper face of Q. We prove that H(x;, %;) 1 Q has a non-empty interior 
in H(x,, «,); this will imply that H(x;, % )) 0 Q is a facet. We have 


H(x;, 0) VQ= H(x;, at ;) ™ () K(x;, &;) 
i=1 


= H(x;, 4) (\ K(x;, %) 
aj 
> H(x,;, «;) 0 int () K(x;, %) 
aj 
# OD, 


cf. Theorem 8.1. Since the set 


H(x,;, a) 0 int (\ K(x;, %) 
Fi 
is open in H(x,, «;), the desired conclusion follows. 
Conversely, if (*) is reducible, then 


Q = {) K(x, at) 
Fj 
for some j. Suppose that H(x;, «;) 0 Q is a facet of Q. Let x be a relative 
interior point of H(x,, «;) 0 Q. Using (a) we see that there is ani with i 4 J 
and x € H(x,, #;) © Q. But then we must have 


A(x,, 0 ;) OO = A(x;, 4%) OQ, 
cf. Corollary 5.5. This, however, implies 
K(x;, at ;) = K(x;, %), 


a contradiction. Hence, H(x;, «;) 0 Q is not a facet of Q. CL] 
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The preceding theorem shows that most polyhedral sets have facets, the 
only exceptions being affine subspaces. 


Theorem 8.3. Let F be a proper face of a polyhedral set QO in R*. Then there is a 
facet G of Q containing F. 


PRroor. We may assume that dim Q = d. Choose an irreducible representa- 
tion 


Q= (\ Kou, ;). 


Let x be a relative interior point of F. By Theorem 8.2(a), there is j such that 


x € A(x;, &) 0 Q. 


Jj? 
Now, F is the smallest face containing x, cf. Theorem 5.6, and H(x,;, «;) 1 Q 
is a facet containing x, cf. Theorem 8.2(c). Therefore, with 
G:= H(x;,a;))VQ 


we have the desired conclusion. LJ 


Corollary 8.4. Let Q be a polyhedral set in R*. Then every face of Q is also a 
polyhedral set. 


PROOF. We need only prove the statement for proper faces of @. Theorem 8.3 
shows that any proper face of Q is a face of a facet of Q. Facets of Q, however, 
are polyhedral sets by Theorem 8.2(b). The statement then follows by induc- 
tion on the dimension. L 


Corollary 8.5. Let Q be a polyhedral set in R*. Then the number of faces of Q 
is finite. 


PROOF. The number of facets of a polyhedral set Q is finite, cf. Theorem 8.2(b). 
Each proper face of Q is a face of a facet of OQ by Theorem 8.3. The statement 
then follows by induction on the dimension. LJ 


Corollary 8.6. Let Q be a polyhedral set in R* with dim Q = d. Let F, and F, 


be faces of Q with 
F,c F, 
and 
dim F,; = j, dim F, = k, 
where 


O<j<j+il<k-1<k<d. 


56 2. Convex Polytopes 


Then there are faces F ,41,..., Fy, of Q with 


Fi, oF, Co C Fy, © F, 
and 
dim F,; = i, i=jt+tl,...,k-—1. 


PRooF. By Theorem 5.2, F, is a proper face of F,. And by Corollary 8.4, F, 
is polyhedral. Theorem 8.3 then ensures the existence of a facet F,_, of F, 
with F, c F,_,. Ifj = k — 2, we have the desired conclusion. If j < k — 2, 
we argue as above with F,_, replacing F,. Continuing this way, we obtain 
faces F, with the desired properties. (J 


In Corollary 8.6, note that we actually have 


F,S Fiji, So GPFy-1 SE Fy. 


Note also that the statement is not valid in general with j = —1. 
We conclude this section with the following: 


Corollary 8.7. Let QO be a non-empty bounded polyhedral set in R*. Then Q is a 
polytope. 


PrRooF. By assumption, Q is a compact convex set. By Corollary 8.5, ext Q 
is a finite set. The statement then follows from Theorem 7.1, (b) = (a). 


The converse of Corollary 8.7 is also valid, see Section 9. 


EXERCISES 


8.1. Show that the image of a polyhedral set under an affine mapping is again a poly- 
hedral set. 


8.2. Show that every face of a non-empty polyhedral set is exposed. 


8.3. Show that every non-empty polyhedral set not containing any line has at least one 
vertex. (Here, of course, a vertex of a polyhedral set means a 0-dimensional face, 
exposed by Exercise 8.2.) 


§9. Polarity of Polytopes and Polyhedral Sets 


In this section we shall apply the polarity theory of Section 6 to polytopes and 
polyhedral sets. We shall show that the polar of a polytope with o as an 
interior point is a bounded polyhedral set with o as an interior point, and 
conversely. As promised in Section 8, we shall deduce that every polytope is 
a bounded polyhedral set (whence polytopes can also be described as non- 
empty bounded polyhedral sets). Furthermore, we shall improve a result 
of Section 6 by showing that dim F + dim G = d — 1 when F and G are 
conjugate faces of mutually polar d-polytopes. 
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The following theorem explains in detail the polarity of convex polytopes 
and polyhedral sets. Note that the polyhedral sets Q having a representation 
of the particular form 


O= q K(x;, 1) 
are precisely the polyhedral sets which have o as an interior point. 
Theorem 9.1. Let x,,...,X,, where n > 1, be distinct points of R‘, and let 
P := conv{x,,..., Xn}; 
0:= al K(x;, 1). 


Then we have: 


(a) P?=@. 

(b) O° = conv{o, X,,..., Xp}. 

(c) P and Q are mutually polar sets if and only if o€ P. 

(d) P and Q are mutually polar sets with Q bounded if and only if o € int P. 


(e) Suppose that P and Q are mutually polar sets with Q bounded (i.e. o € int P, 
cf. (d)). Then we have 


ext P = {x,,...,X,} 


if and only if the representation 


Q = () K(x;, 1) 
i=1 
is irreducible. 


PROOF. (a) Formula (1) of Section 6 shows that 


{X1,..+5X_h° a Q, (1) 
and formula (2) of Section 6 shows that 
{Xy,--+,X,_f}° = (conv{x,,..., X,})° (2) 


since M c K(y, 1) if and only if conv M c K(y, 1). Combining (1) and (2) 
we obtain (a). 
(b) Using (a), Theorem 6.2 and Corollary 2.9 we have 


O° _ Pee 
= clconv{o, x;,...,X,} 
= conv{o, X1,...,X,} 


which proves (b). 
(c) This is an immediate consequence of (a) and (b). 
(d) This follows from (c) and Theorem 6.1. 
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(ec) By assumption we must have n > 2. Forj = 1,..., n, let 


P; ra conv{x,, sey Xj—15 Xj+ds cy Xn}s 


QO; i= () K(x;, 1). 
ei 


Note that application of (a) to {x4,...,Xj-1, Xj+is-++> Xn} instead of 
{x1,..+,X,} gives 


Po = Q,. (3) 

Furthermore, Theorem 7.2, (a) => (b) shows that we always have 
ext P< {x ,..., X,}- (4) 
Now, if ext P is a proper subset of {x,,..., x,}, then P = P,; for some j by 


Theorem 7.2(c). Then also P° = PS, whence P° = Q, by (3). But P° = Q 
by (a), and therefore we have Q = Q,. This shows that the representation of 
QO is reducible. 

Conversely, if the representation of Q is reducible, then Q = Q, for some j, 


and so also Q° = Q%. Application of (b) to {x1,...,Xj—1, Xj+as +++» Xnb 
instead of {x,,..., X,} gives 
OS = CONV{O, X4, 6-5 Xj—-as Xje aye e +s Xn 


Since Q° = P by assumption, and Q° = Q%, as we just have seen, it follows 
that 


P = conv{o, X4,.--, Xj-4, Xjeis +++ Nuh 


Now, Theorem 7.2, (a) <> (b) shows that here any non-extreme point of P 


among the points 0, x;,..., Xj-1, Xj+1, +--+» X, May be omitted. It follows 


from (4) and the assumption that o is such a point. Therefore, 
P = conv{xy,...,Xj—4, Xjtis +++ Xnf- 
Theorem 7.2, (a) = (b) then shows that 
ext P< {Xq, 0.6, Xjias Xjpas ++ Mah 


whence ext P is a proper subset of {x,,..., Xp}. LJ 


We are now ready to prove the following main theorem: 


Theorem 9.2. A non-empty subset P of R‘ is a polytope if and only if it is a 
bounded polyhedral set. 


Proor. We have already proved the “if” statement in Corollary 8.7. Con- 
versely, let P be a polytope in R’, say 


P = conv{x,,...,Xpy}- 
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To prove that P is a bounded polyhedral set it causes no loss of generality 
to assume that oeint P. Theorem 9.1(a) shows that P° = Q, where Q 
denotes the polyhedral set defined by 


Q= () KO, }). 


It follows from Theorem 9.1(d) that Q is bounded and that Q° = P. Applying 
now Corollary 8.7 to Q, it follows that Q is a polytope, say 


O =conv{y,,..., Vmt- 


We next apply Theorem 9.1 to {y,,..., y,,} instead of {x,,...,x,}. Statement 
(a) shows that Q° = R, where R denotes the polyhedral set defined by 


R = (| K(y;, 1). 
jJ=1 


But we have already seen that 0° = P, whence 


j=l 
i.e. P is a polyhedral set. LJ 


We may now use the results of Section 8 on polyhedral sets to obtain 
results on polytopes. 


Corollary 9.3. Let P,; and P, be polytopes in R4 such that P, A P, ¥# ©. 
Then P, Q P, is also a polytope. 


PROOF. The intersection of any two polyhedral sets in R? is polyhedral. The 
statement then follows from Theorem 9.2. LJ 


Corollary 9.4. Let P be a polytope in R‘, and let A be an affine subspace of R® 
such that PA A # &. Then Pc A is also a polytope. 


PRooF. Any affine subspace A of R* is polyhedral. The statement then follows 
as in the proof of Corollary 9.3. LI 


Corollary 9.5. Let P be a d-polytope in R*. Then P has at least d + 1 facets. 
PROOF. Let 

P = conv{x,,...,X,}; 
and assume without loss of generality that o € int P. Let 


Q:= (\ K(x;, 1). 


i=] 
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Then by Theorem 9.1(d), P and Q are mutually polar sets with Q bounded 
(and o € int Q). Corollary 8.7 next shows that Q is a d-polytope. Let y,;,..., Vn 
be the vertices of Q, and let 


R = () K(y;, 1). 
j=l 


Then Q and R are mutually polar by Theorem 9.1(d), whence R = P. More- 
over, Theorem 9.1(e) shows that the representation 


j=1 


is irreducible. But then the number of facets of P is m by Theorem 8.2(b), (c). 
On the other hand, the number of vertices of the d-polytope Q is at least 
d+ 1, whence m > d + 1, as desired. C] 


In the next corollary, note that when F is a facet of a d-polytope in R’, 
then aff F is a supporting hyperplane of P. 


Corollary 9.6. Let P be ad-polytope in R’, let F,,..., F,, be the facets of P, and 
let K(x;, ;) be the supporting halfspace of P bounded by aff F; fori = 1,...,n. 
Then 


P= (| K(x;, Ati), 


i=1 


and this representation is irreducible. 


PROOF. By Theorem 9.2, P is polyhedral. Let 
P= ( K(y;, B;) 
j=l 


be an irreducible representation of P. By Theorem 8.2(b), (c), the facets of P 
are the sets H(y,, B;) © P. But the facets of P are also the sets H(x;, «;) A P by 
assumption. Therefore, m =n and there is a one-to-one correspondence 
between the 7’s and ;’s such that 


A(x;, %;) = A(y;, B;) 
for corresponding i and j. Then of course also 
K(x;, %;) = K(y;, B;) 


for corresponding i and j. This shows that 
P = () K(;, O;)s 
i=1 


and that this representation is irreducible. LI 
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Corollary 9.7. Let P be a d-polytope in R*. Let F , and F,, be faces of P with 
F,c F, 
and 
dim F; = j, dim F, = k, 
where 
—-l<j<jti<k-1l<k<d. 
Then there are faces F ;,,,..., Fy—, of P with 
F,CO Fi, C++ CFy_y CF, 
and 
dim F; = i, i=j+1,...,k-—1. 


PRrooF. With Theorem 9.2 in mind, the statement follows immediately from 
Corollary 8.6 when j > 0. For j = —1, let Fo be any vertex of F,. If k = 1, 
we have the desired conclusion. If k > 2, apply Corollary 8.6 to the faces 
Fy and F,. a 


We shall finally improve the inequality of Theorem 6.10. Note that when 
P is a d-polytope in R¢ with o € int P, then P and P° form a pair of mutually 
polar d-polytopes, and each pair of mutually polar d-polytopes arises in this 
way; this follows from Theorem 9.1(a), (d) and Theorem 9.2. We also remind 
the reader that any face F of P is a member of a pair F, G of conjugate faces 
since all faces of P are exposed, cf. Theorem 7.5. 


Theorem 9.8. Let P and Q be mutually polar d-polytopes in R¢ and let F and G 
be conjugate faces of P and Q, respectively. Then 


dim F + dim G = d — 1. 


In particular, vertices of P are conjugate to facets of Q, and facets of P are 
conjugate to vertices of Q. 


PRrooF. We shall appeal to the proof of Theorem 6.10. As explained there, we 


need only consider the case where F and G are proper faces. Let x,,..., x, 
be the vertices of P, and let x,,..., x, be the vertices of F. Then by Theorem 
9.1(a), 
Q = (| K(x;, 1). (3) 
i=1 


Moreover, by the definition of the A-operation, we have 


G =00 () H(, 1). (6) 


xeF 
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Now, note that 


k 
() H(x, 1) = () H(x;, 1). (7) 
xe F i=] 
In fact, if yisin H(x;, 1) fori = 1,...,k, then x,,...,x,arein H(y, 1), whence 
every x € F must be in H(y, 1), showing that y is in H(x, 1) for every x € F. 
Combining now (5), (6) and (7) we obtain 


G= a K(x;, 1) 0 () HG, 1). (8) 


i=k+1 i=1 


Let 


_)* 


Ai= 


l 


A(x,, 1). 


1 


Then A is an affine subspace containing G. In fact, A = aff G. To see this, 
note first that (8) shows that G is a polyhedral set in A with the representation 


G= ()\ K(x;,1)AA 
t=k+1 
(where, of course, we may have K(x;, 1) 7 A = A for certain values of i). 
Nowy, it is clear that the non-empty intersection G of closed halfspaces in the 
affine space A can only have a smaller dimension than A itself if G is con- 
tained in a hyperplane bounding one of the halfspaces K(x;, 1) 0 A, i = 


k + 1,...,n. But ifGisa subset of H(x,, 1),thenx;¢€G° = F,acontradiction. 
In conclusion, A = aff G, whence in particular 
dimG = dim A 
= dim ( \ H(x, 1). 
xeF 
Hence, in the proof of Theorem 6.10 we have equality in (9). The rest of the 
proof of Theorem 6.10 then yields the desired formula. LJ 


For another proof of Theorem 9.8, see Section 10. 
Note that the proof of Corollary 9.5 could also have been based on 
Theorem 9.8. 


EXERCISES 


9.1. A section of a polytope P in R® is the intersection of P and some affine subspace of 
R‘. Show that every polytope P with n facets is a section of an (n — 1)-simplex. 
(This is to be understood as follows: “Embed” P in R"~'; construct an (n — 1)- 
simplex S in R"~! such that S q aff P = P. Hint: One may use Exercises 6.5 and 
7.1.) 


9.2. Let P and Q be mutually polar convex polytopes in R%, and let F and G be conjugate 
faces of P and Q, respectively. Show that G = Qc. H(x, 1) if and only if x eri F, 
cf. Exercise 6.6. 
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§10. Equivalence and Duality of Polytopes 


It may be said that the combinatorial theory of convex polytopes (which is 
the subject of Chapter 3) is the study of face-lattices of convex polytopes. So, 
from this point of view, there is no need to distinguish between polytopes 
whose face-lattices are isomorphic. This leads to the notion of equivalent 
polytopes. 

In Section 6 we developed a polarity theory of convex sets, and in Section 9 
we applied it to obtain basic properties of convex polytopes. It is a funda- 
mental fact that for a pair P, Q of mutually polar polytopes, the /,-operation 
induces an anti-isomorphism of the face-lattices. Accepting the point of view 
explained above (leading to the notion of equivalence), it follows that there 
is no need to distinguish between Q and any polytope whose face-lattice is 
isomorphic to that of Q. These polytopes, however, are just the polytopes 
whose face-lattices are anti-isomorphic to that of P. This leads to the notion 
of dual polytopes. 


Two polytopes are said to be equivalent (and each is said to be an equivalent 
of the other) if their face-lattices are isomorphic. Clearly, this is an equiva- 
lence relation. The image g(P) of a polytope P under an affine isomorphism 
g is an equivalent of P; but in general there are many other equivalents of P. 


Theorem 10.1. Let P and Q be equivalent polytopes with dim P = d, and let 
p:(F(P), <) + (F(Q), ©) 
be an isomorphism. Then 
dim Q = d, 
and 
dim ~(F) = dim F 
for any face F of P. 
PROOF. By Corollary 9.7, each face F of P is a member of a chain 
O=F_,¢S---GF,E---SFy=P (1) 
of faces of P with 
dim F; = i, i= —l,...,d. 
Since @ is an isomorphism, (1) yields 
O=PF-)S--- SOF) S--- & OF = Q. (2) 
Now, (2) implies 
dim g(F;+.1,) = dim g(F;) + 1, i= —l,...,d—-1, (3) 
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cf. Corollary 5.5. This clearly implies dim g(F,) > d, 1.e. 
dim Q > d = dim P. 


But P and Q play completely symmetric roles, and therefore we must actually 
have 


dim Q = d, 


as desired. This, in turn, implies that we must have equality in (3) for all i, 
enforcing 


dim g(F;) = i, i= —l,...,d. 
This completes the proof. LJ 
Two polytopes are said to be dual (and each 1s said to be a dual of the 
other) if their face-lattices are anti-isomorphic. We note that when P and Q, 
are dual, then P and Q, are also dual if and only if Q, and Q, are equivalent. 
The question of existence has almost been settled by Corollary 6.8: 
Theorem 10.2. For any polytope P, there is a dual polytope Q. 


Proor. For any d-polytope P there is a d-polytope P, in R* with o€ int P, 
such that P and P, are equivalent. Corollary 6.8 shows that Q, := P{ is a dual 
of P,. But then Q, is also a dual of P by the equivalence of P and P,. LJ 


The next theorem is closely related to Theorem 9.8, see the remarks below: 


Theorem 10.3. Let P and Q be dual polytopes with dim P = d, and let 
Wi(F(P), ©) > (F(Q), ©) 
be an anti-isomor phism. Then 
dim QO = d, 
and 
dim W(F) = d — 1 — dim F 
for any face F of P. 


Proor. As in the proof of Theorem 10.1, we use the fact that each face F of P 
is a member of a chain 


O=F..E--GFE--SFy=P (4) 
of faces of P with 
dim F; = 1, i= —l,...,d. (5) 
Since yw is an anti-isomorphism, (4) yields 


Q=WF_)2:° 2WF)2°:: 2WFd) = D. (6) 
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Now, (6) implies 
dim W(F;) > dim W(F;.,) + 1, i= —l,...,d—1. (7) 
This clearly implies dim w(F _,) > d, Le. 
dim Q > d = dim P. 
The symmetry of P and Q then ensures that we must have 
dim Q = d, 


proving the first statement. This, in turn, implies that we must have equality 
in (7) for all i, whence 


dim W(F,) =d—1—-i 
~d—1—dimF,, i= —1,...,d, 


completing the proof. = 


It is clear that Theorem 9.8 is an immediate consequence of Theorem 10.3. 
On the other hand, Theorem 10.3 could also have been deduced from Theorem 
10.1; just observe that when P and Q are dual with dim P = d, then there isa 
pair P,, Q, of mutually polar d-polytopes such that P is equivalent to P, 
and Q is equivalent to Q,. 

We next prove two important theorems on the facial structure of poly- 
topes; their proofs depend on Theorem 10.3. 


Theorem 10.4. Let P be a d-polytope, and let F be a proper face of P. Then F 
is the intersection of the facets of P containing F. If F is a k-face, then for 
k =0,1,...,d — 3 there are at least d — k such facets, for k = d — 2 there 
are exactly 2 (=d—k) such facets, and for k = d — 1 there is exactly 1 
(= d — k) such facet. 


PRrooF. Let Q be a dual of P, and let w be an anti-isomorphism from (¥(P), <) 
onto (F(Q), <). Let F be a k-face of P, and let G := W(F). Then 


dimG=d—1-—k 
by Theorem 10.3. But then G has at least 
(d-—1—-—k)+1=d—k 


vertices. For k = d — 2 and k = d — 1, the number of vertices is exactly 
d — k, since 1-polytopes have two vertices and 0-polytopes have one vertex. 
We now apply the anti-isomorphism y~' from (¥F(Q), <) onto (F(P), <). 
The dimension formula of Theorem 10.3 shows that vertices of Q correspond 
to facets of P. Therefore, since G is the smallest face of Q containing the 
vertices of G, it follows that F is the intersection of the facets of P containing F. 
This proves the statement. C] 
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Theorem 10.5. Let P be a d-polytope, and let x be a vertex of P. Then there are 
at least d edges of P containing x. 


PROOF. Let QO be a dual of P, and let w be an anti-isomorphism from (F(P), <) 
onto (F(Q), <). Let G := w({x}); then by Theorem 10.3, G is a (d — 1)-face 
of Q. Corollary 9.5 next shows that the number of (d — 2)-faces of G is at 
least d. But then by duality the number of 1-faces of P containing x is at 
least d, cf. Theorem 10.3. LJ 


Using terminology from graph theory (cf. Appendix 2 and Section 15), 
we Shall say that two distinct vertices of a polytope P are adjacent if the seg- 
ment joining them is an edge of P, and we shall say that a vertex and an edge 
are incident if the vertex is a vertex of the edge. With this terminology, 
Theorem 10.5 states that the number of vertices adjacent to x, or the number 
of edges incident to x, is at least d. 

For any d-polytope P, let fP) denote the number of j-faces of P, where 
j= —1,0,.... Thus f_,(P) = f,(P) = 1 and f(P) = 0 for j > d. Ford > 1, 
the d-tuple 

F(P) = (fo(P), «+s fa-1¢P)) 
of positive integers is called the f-vector of P. This concept will play a central 
role in Chapter 3. Here we note an immediate corollary of Theorems 10.1 
and 10.3: 


Corollary 10.6. Let P be a d-polytope (where d > 1). Then for any polytope 
P, equivalent to P we have 


f(Py)= (fo(P), --- » fa—1(P)), 
i.e. f(P,) = f(P), and for any polytope Q dual to P we have 


f(Q) = (fa-1€P), «+ -, fo(P)). 


EXERCISES 
10.1. Show that an equivalent of a d-pyramid is again a d-pyramid. 
10.2. Show that a dual of a d-pyramid is again a d-pyramid. 


10.3. Show that if Q, and Q, are equivalent polytopes, then any pyramid P, with Q, 
as a basis is equivalent to any pyramid P, with Q, as a basis. 


10.4. Show that the statement of Exercise 10.1 is not valid in general for bipyramids. 
10.5. Show that any bipyramid has prisms as well as non-prisms as duals. 


10.6. Let P, and P, be polytopes, let g’ be a one-to-one mapping from the vertices of 
P, onto the vertices of P,, and let ~” be a one-to-one mapping from the facets of 
P, onto the facets of P,. Assume that y’(x) is a vertex of y"(F) if and only if x 
is a vertex of F. Show that there is an isomorphism g from (¥(P,), <) onto 
(¥(P,), <) which extends both g’ and ¢”. In particular, P, and P, are equivalent. 

State and prove an analogous dual statement. 


$11. Vertex-Figures 67 


10.7. Use a duality argument to show that no vertex of a polytope P is contained in all 
the facets of P. 


10.8. Let F, and F, be faces of a d-polytope P such that F, < F, and dim F, < 
dim F, < d — 1.Showthat there isa face F, of Psuch that dim F; = dim F, +1, 
F, < F,and F, ¢ F,. (Hint: Exercise 10.7 may be useful.) 


10.9. Let P be a d-polytope. Show that for j < k < d — 1, any j-face F of P is the 
intersection of the k-faces of P containing F. 


10.10. Let P be an octahedron and let Q be a 3-polytope obtained by “adding pyramids” 
over two of the facets of a 3-simplex. Show that f(P) = {(Q). Show that P and Q 
are non-equivalent. 
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The vertex-figures of a d-polytope P are certain (d — 1)-polytopes, each 
containing information about the “local” facial structure of P “near” one of 
its vertices. In this section we have collected some results dealing with or 
related to vertex-figures. 


We first study the facial structure of a non-empty intersection H m P ofa 
d-polytope P in R* and a hyperplane H in R*. Note that when H does not 
intersect int P, then H must be a supporting hyperplane by Theorem 4.1, and 
H © P is then simply a face of P (whose facial structure is known when the 
facial structure of P is known). 


Theorem 11.1. Let P be a d-polytope in R‘, and let H be a hyperplane in R* 
such that 


Hoint P #~ @. 
Then the following holds: 


(a) The set P’:= Hc P is a(d — 1)-polytope. 

(b) Let F bea face of P. Then F':= H ca F isa face of P’,and dim F’ < dim F. 
If F # @and H is not a supporting hyperplane of F (i.e. F’ is not a face of F 
and hence not a face of P), then dim F’ = dim F — 1. 

(c) Let F’ be a face of P’. Then there is at least one face F of P such that 
F’ = H © F, and for each such face F we have dim F > dim F’. 

(d) Let F' be a face of P’. If F’ is not a face of P, then there is one and only 
one face F of P such that F’ = H c F, and for this face F we have dim F = 
dim F’ + 1. 


PROOF. (a) The set P’ is a polytope by Corollary 9.4. It is clear that the 
dimension of P’ isd — 1. 


68 2. Convex Polytopes 


(b) It follows immediately from the definition of a face that F’ is a face of 
P’, and it is clear that dim F’ < dim F. If F # @ and H 1s not a supporting 
hyperplane of F, then we have F ¢ H and HoriF # ©, cf. Theorem 4.1. 
But then H o- aff F is a hyperplane in aff F intersecting the interior of F in 
aff F, whence dim(H - F) = dim F — 1. 

(c) For F’ = @, the statement is obvious. For F’ # @, we first note that 
it is trivial that dim F > dim F’ when F 1s a face of P such that F’ = HF. 
To prove the existence of such a face, let x) Eri F’ and let F, be the smallest 
face of P containing x,. Then x, €ri Fo, cf. Theorem 5.6. We shall complete 
the proof by showing that 


F =H Fo. (1) 


Let ye F’ with y # x,.. Then there exists a point ze F’ such that x9 € |y, z[, 
cf. Theorem 3.5, (a) = (c). Since Fy, is a face of P containing xo, it follows that 
yeéF, (and ze F,). Since F’ c H, this proves < in (1). Conversely, let 
yeHo Fowithy # Xo. Then there exists a point z € Fy, such that x9 € Jy, 2[, 
cf. Theorem 3.5, (a) = (c). Since x, and y are in H, then z must also be in H, 
whence y, z€ P’. Since F’ is a face of P’ containing Xo, it follows that ye F’ 
(and ze F’). This proves => in (1). 

(d) For F’ = @, there is nothing to prove. For F’ # @, we refer to the 
proof of (c). Let F be any face of P such that F’ = Hq F. Then F must contain 
the point x,, and hence Fy c F by the definition of Fy. Corollary 5.5 next 
shows that if F, & F, then dim F, < dim F. We shall complete the proof by 
showing that 


dim Fy = dim F 
= dim F’ + 1. 


Let G be any face of P such that F’ = HG. Since F’ 1s not a face of P by 
assumption, statement (b) applied to G gives dim G = dim F’ + 1. Since this 
applies to both G = Fy and G = F, we have the desired conclusion. LJ 


Let x, be a vertex of a d-polytope P in R* (where d > 1). Then there is a 
supporting hyperplane H(y, «) of P such 


H(y, «) 0 P = {Xo}. 


Assuming that P < K(y, «), it then follows that for some f < a, all the 
vertices of P except x, are in K(y, B)\H(y, B), whereas xg is in R“\ KG, B). 
In other words, there is a hyperplane H which separates x, from the remaining 
vertices of P in the sense that x, is on one side of H and the remaining 
vertices are on the other side of H. By a vertex-figure of P at X) we mean a 
set H 7 P (in fact, a (d — 1)-polytope, cf. Theorem 11.2), where H is a hyper- 
plane separating x, from the remaining vertices of P. 

When x, is a vertex of P, we denote by F(P/x,) the set of faces of P 
containing x,. It is clear that (F(P/x,), <) is a sublattice of (F(P), <). This 
lattice is “essentially” the face-lattice of the vertex-figures of P at xo: 
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Theorem 11.2. Let P be a d-polytope in R‘, let xq be a vertex of P, and let 
P’=HOP be a vertex-figure of P at X,. Then P’ is a (d — 1)-polytope. 
Furthermore, the mapping 


FReF':=HOF 


is an isomorphism from the sublattice (F(P/xo), <) of (F(P), <) onto the 
lattice (F(P’), <). 


Proor, The hyperplane H intersects P and is not a supporting hyperplane. 
Therefore, 


HomtP4#@ 


by Theorem 4.1. Then P’ is a (d — 1)-polytope by Theorem 11.1(a). 
It follows from Theorem 11.1(b) that the mapping 


FReF :=HOF 


maps ¥(P/x,) into F(P’), and it follows from Theorem 11.1(d) that the 
mapping is one-to-one and onto. Finally, it is trivial that the mapping 
preserves inclusions. LI 


Corollary 11.3. Let P be a d-polytope in R*, and let xo be a vertex of P. Then 
any two vertex-figures of P at Xo are equivalent. 


Proor. In fact, if P’ and P” are vertex-figures of P at xj, then (¥(P’), <) and 
(F(P"), <) are both isomorphic to (F(P/x9), <), and therefore mutually 
isomorphic. LJ 


Of course, Corollary 11.3 can also be proved by an argument based 
directly on Theorem 11.1. 

Now, let F, and F, be faces of ad-polytope P such that F,; < F,.Then the 
set of faces F of P such that 


F,cFcF, 


will be denoted by F(F,/F,). When F, = {xo} and F, = P, we recover 
¥(P/X). It is clear that (F(F,/F ,), <) is a sublattice of (F(P), <). It follows 
from Theorem 11.2 above that in the particular case where F, is a vertex and 
F, = P, the lattice (F(F,/F,), <) can be “realized” as the face-lattice of a 
suitable polytope. This is true in general (except, of course, when F, = F): 


Theorem 11.4. Let P be a polytope, let F , be a j-face of P, and let F, be ak-face 
of P such that F, & F,. Then there is a (k — 1 — j)-polytope Q such that 
(F(F,/F,), <) is isomorphic to (F(Q), <). Furthermore, for every isomor- 
phism from (F (F ,/F,), <) onto (F(Q), <) we have 

dim g(F) = dim F — 1 — j 


for any face F € F(F2/F;). 
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PRoor. We consider F, as a j-face of the k-polytope F,; the faces F of P 
belonging to F(F,/F,) are then the faces F of F, such that F,; c F. Let G, 
be a dual polytope of F,, and let w be an anti-isomorphism from (¥ (F,), <) 
onto (F(G,), <). The faces of G, corresponding under w to the faces F of 
F, with F, c F are then the faces G = W(F) of G, such that 


O=WF,)- G = WF) c W(F,). 


This shows that the restriction of W to (F(F,/F,), <) is an anti-isomorphism 
from (F(F,/F,), <) onto the face-lattice (F(wW(F,)), <) of the polytope 
W(F,). Therefore, if we take Q to be any dual polytope of w(F,), we see that 
(F(F,/F,), <) 1s isomorphic to (F(Q), <). 

To determine the dimension of Q, note that 


dim Q = dim W(F,) 
by Theorem 10.3. But 
dim W(F,) =k —-—1—dimF, 
=k—1-j 
by the dimension formula of Theorem 10.3. Hence, 
dimQ=k—1—-j, 


as desired. 
Finally, let @ be any isomorphism from (¥(F,/F,), <) onto (F(Q), <). 
Every face F € ¥(F,/F,) is a member of a chain 


FPF, =G,;S°:°-SG;ES°'°-SG,=F, 
of faces G;€ ¥F(F,/F,) with 
dim G,; = i, i=j,...,k, 
cf. Corollary 9.7. Application of @ yields the chain 
OD = PFi1) = PG) E--- |S PG) |S--- |S PG) = O(F2) = Q. 
This implies 
—1=dim 9g(G,) < --- < dim g(G;) < --- < dim ¢(G,) 
=dmQ=k-—-1—j, 


where we have used Corollary 5.5 and the expression for dim Q found above. 
This in turn enforces 


dim g(G,;) =i-—1-—j, i=j,...,k, 
whence 
dim ~(G;) = dim G; — 1 — j, i=j,...,k. 
This completes the proof. LJ 
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We note the following: 


Theorem 11.5. Let P and Q be dual d-polytopes, and let 
W:(F(P), <) > (F(Q), ©) 


be an anti-isomorphism. Let Xo be a vertex of P, and let P’ be a vertex-figure 
of P at Xo. Then the facet W({Xo}) of Q is a dual of P’. 


Proor. We know by Theorem 11.2 that (F(P’), <) and (¥(P/xo), <) are 
isomorphic. Taking F,; = {x } and F, = P in the proof of Theorem 11.4, we 
see that (¥(P/x,), <) is isomorphic to (A(R), <), where R is any polytope 
dual to W({x,}). (This polytope R is denoted by Q in the proof of Theorem 
11.4.) Therefore, (F(P/x,), <) is anti-isomorphic to (F(W({xo})), <), as 
desired. LI 


Let x, be a vertex of a d-polytope P. Vertex-figures of P at xq arise from 
hyperplanes H separating x, from all the remaining vertices of P. We shall 
prove that in order to have H separating x, from all the remaining vertices of 
P it suffices to have H separating x, from those vertices of P that are adjacent 
to X,.. We first prove: 


Theorem 11.6. Let P be a d-polytope in R*, let xo be a vertex of P, and let 
X1,-++5 X, be the vertices of P adjacent to Xp. Let H(y, «) be a hyperplane in R4 
such that Xo € H(v, «) and x,,...,x,€ K(y, «). Then P < K(y, «) i.e. H(y, %) 
is a supporting hyperplane of P. If, in addition, we have X,,..., xX, € H(y, &), 
then H(y, «) OP = {xo}. 


Proor. Let P’ = H’ cn P bea vertex-figure of P at x), determined by a hyper- 
plane H’ separating x, from the remaining vertices of P. Theorem 11.1(b), 
(d)—or Theorem 11.2—tells that the vertices of P’ are the 1-point sets 
[xo.xX;)] OH’, i=1,...,k. Since both xo and x,,..., x, are in K(y, «) by 
assumption, it follows that the vertices of P’ are in K(y, «), and, therefore, P’ 
is in K(y, @). 

Let x be any vertex of P with x # x9. Then x and x, are on opposite sides 
of the hyperplane H’, whence [x9, x] © H’ is a 1-point set, say {x’}. Since 
xe P’ and P’ c K(y, a), it follows that x’ K(y, «). This, in turn, clearly 
implies x € K(y, «). In other words, all the vertices of P are in K(y, a), whence 
Pc Ki, &). 

If, in addition, x,,..., x, do not lie in H(y, «), then ]xo, x;[ < int K(y, a) 
fori = 1,..., k. So, all the vertices of P’ belong to int K(y, «), and therefore 
P’ c int K(y, a). For any vertex x of P, x # x9, we then have x’ € int K(y, «), 
implying that x € int K(y, «). (Here, as above, x’ denotes the single point in 
[x9, X] 0 H’.) This shows that the only vertex of P in the exposed face 
H(y, «) 0 P is Xo, implying that H(y, 0) 0 P = {xo}, ef. Theorem 7.3. LJ 
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Corollary 11.7. Let P be a d-polytope in R‘, let x. be a vertex of P, and let 
X1,..., X, be the vertices of P adjacent to x). Then 


aff {xo, X15 ey x,} —_ R4 


ProoF. If the desired conclusion is not valid, then there is a hyperplane H 
containing X9, X;,..., X,. Then both of the two closed halfspaces bounded 
by H contain X9,X;,...,x,. By Theorem 11.6 this implies that P is contained 
in both of these halfspaces, whence P < H. This contradiction completes 
the proof. C] 


We can now prove: 


Theorem 11.8. Let P be a d-polytope in R‘, let x9 be a vertex of P, and let 
X1, +++, X, be the vertices of P adjacent to x9. Let H be a hyperplane in R? 
separating Xo from X,,...,X,. Then H separates Xo from any other vertex of P, 
whence H - P is a vertex-figure of P at Xo. 


PROOF. Given the vertex xg and its adjacent vertices x,,..., x,, let x be any 
other vertex of P. Let L denote the line through x, and x. We first prove that 
|x, x[ intersects the set 


QO, i conv{x,, sey X,}. 


Let H, bea hyperplane in R¢ orthogonal to L, and let z: R4 + H, denote the 
orthogonal projection. Letting 


0 = CONV{Xg, X15 sees Xb 
it is clear that 
T™(Qo) = CONV{N(Xg), M(X1), -.- 5 T(X,)}- 


In particular, 2(QQ) is a polytope with 


ext 2(Qo) < {n(Xo), M(X,),..., 1(X,)}, 


cf. Theorem 7.2, (a) = (b). Suppose that (x9) is a vertex of 2(Q,). Then there 
is a supporting hyperplane H, of x(Q,) in H, with H, A n(Qo) = {x(x,)}, cf. 
Theorem 7.5. But then x” '(H,) = aff(H, UL) is a supporting hyperplane 
of Qy in R¢ with x,,...,x,¢2 ‘(H,) and xen '(H,), contradicting the 
second statement of Theorem 11.6. Hence, z(x9) is not a vertex of 2(Q,). 
This implies that 

MX) €conv{m(x,),..., a(x,)}, 


cf. Theorem 7.2, (b) => (a). Since 


mM(CONV{X,,..., X,}) 
™Q,), 
it follows that z(x9) € 2(Q,), implying that L intersects Q,. However, since 


Xo and x are vertices of P, and Q, is asubset of P, every point of L in Q, must 
lie between x, and x. 


conv{m(x,),..., (x,)} 


| 
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To complete the proof, let H be a hyperplane separating x, from X,,..., Xx; 
and let K be that closed halfspace bounded by H which contains x,,..., X,- 
Then x,,..., x, belong to int K, whence also Q, c int K. Let x be any other 


vertex of P. Using what we have proved above, we see that at least one point 
of ]xo, x[ is in int K. By the convexity of R“\int K we must then also have 
x € int K, as desired. L 


The next theorem has an interesting application in the proof of Theorem 
11.10. 


Theorem 11.9. Let P be a d-polytope in R* (where d > 1), and let xq be a 
vertex of P. Then there is a point xo € int P such that the hyperplane through 
x) with Xo — Xo as anormal separates Xo from the remaining vertices of P. 


PRoor. We may assume that x, = o. Let x,,..., x, be the remaining vertices 
of P, and let 

P’ := conv{x,,..-, Xn}- 
Then P’ is a polytope with vertices x,,..., X,, and X, is not in P’. Letve P’ 


be such that 
<v, v> = min{<x, x>|xe P’}. (2) 
The existence of v follows by noting that the mapping 
xh Cx, X> = ||xI|? 


is continuous on the compact set P’. (The point v is in fact the unique point 
of P’ nearest to o.) Since o ¢ P’, it follows that 


0 < <v, v). (3) 
We claim that 
<v, v> = min{<v, x>|xe P’}. (4) 


(Hence H(v, «) with a := <v, v> is a supporting hyperplane of P’ at v.) To see 
this, let x e P’ and let Ae JO, 1[. Then Ax + (1 — A)v is in P’, whence by (2) 


<v,v> < <Ax + (1 — Ajo, Ax + (1 — A)vD 
= (v, v> + 2A(<v, x) — <v, vd) + A*KQv — xX, — Xd. 
Re-arranging and dividing by 2A yields 
<u, v> — Cv, x) S (A/2)Kv — x, v — x). 


This holds for Ae ]0, 1[. By continuity it must also hold for A = 0, ie. (4) 
holds. Now, (3) and (4) imply 


<v, x;> > 0, i=l,...,n. 
By continuity we then have 


<u, xX;> > 9, i=1l,...,n (5) 


74 2. Convex Polytopes 


for all u belonging to some ball B(v, ¢). In particular, o is not in B(v, «). Let 
Uo be a point in Biv, €) F int P, cf. Theorem 3.4(c). Then H(up, 0) is a hyper- 
plane through o with all the vertices x,,..., x, strictly on one side. Therefore, 
for J sufficiently small, 0 < A < 1, the hyperplane H parallel to H(uy, 0) 
through xo := Au, separates o from x,,..., x,. Finally, it is clear that H has 
Xo — Xo (= Xo = Auy) as a normal, and it follows from Theorem 3.3 that 
Xo € int P, since xo € Jo, uof. LI 


The next theorem is an application of Theorem 11.9. The theorem illu- 
strates how the polar operation can be used to produce polytopes equivalent 
to a given polytope with desirable properties. The proof is based on the 
observation that if P and x + P both have o as an interior point, then P° 
and (x + P)° must be equivalent since P° is a dual of P, (x + P)° is a dual 
ofx + P,and Pand x + Pare equivalent. Theorem 11.10 is needed in Section 
19. 


Theorem 11.10. Let P be a d-polytope in R‘, and let F be a facet of P. Then 
there is ad-polytope P, in R* equivalent to P such that the orthogonal projection 
of R* onto the hyperplane spanned by the facet F, of P, corresponding to the 
facet F of P maps P,\F, into ri F,. 


PROOF. We may assume that oeint P. Let Q:= P°; then Q° = P. Let yy 
be the vertex of Q conjugate to F, cf. Theorem 9.8. Let y,,..., y, be the re- 
maining vertices of Q@. Use Theorem 11.9 to get yj € int Q such that 


(Vix Yo — Yor < “Vos Yo — Yoo < “Vo, Yo — Yoo: i=1,...,n. 


Take Q, := Q — yo; then Q, is ad-polytope with o in its interior. The vertices 
of Q, are the points y; — yo,i = 0,..., n. Take P, := Q§. Since Q and Q, are 
equivalent, it follows that P and P, are equivalent (under an obvious lattice 
isomorphism). The facet F, of P, corresponding to F, of course, is the facet 
of P,; conjugate to the vertex yo — yo of O,. Hence, 


aff F; = H(yo — Yo, 1). 
For xé€R*, the orthogonal projection of x onto aff F, is the point x’ = 
x + A(Yo — Yo), where A is determined by 
1 = «X’, Yo — Yo? 
= «%; Yo — Yoo + ACVo — Yo Yo — Yor: 


_1= <x ¥0 — yo 
Yo — Voll” 
Now, by Theorem 9.1(a), 
P, = ia K(y; — Yo, 1). 


i=0O 
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Hence, for xe P, we have <x, y; — Yo) < 1 fori = 0,..., n, and we have 
<x, Vo — Yo < 1 for xe P,\F,. For x € P,\F, and x’ as above we then get 
fori=1l,...,n, 


{x', Vi — YOO = (X%, Vi — YoX + A4Vo — Yoo Vi — Yor 
< 1+ 2y¥o — Yo. Vi — Yor 
< 1, 


since A > O and <yo — yo, Vi; — Yo» < 0. This shows that 
x’ €int K(y; — Yo, 1) 


fori = 1,...,n, whence 


x’ eint (| K(y; — yo. LD). 
i= 1 


“= 


Since 


F, = H(vo — Yo. YO (\ KOi — Yo. 1), 
i=1 
it follows that x’ eri F,, as desired. LI 


In Theorem 11.1 we described in great detail the facial structure of a 
polytope of the form P’ = H - P, where H isa hyperplane and P is a polytope 
whose interior is intersected by H. In a similar way we can describe the facial 
structure of a polytope of the form P’ = K m P, where K is a closed halfspace 
and P is a polytope whose interior is intersected by the hyperplane H 
bounding K. We mention a particular case. 


Theorem 11.11. Let P be a d-polytope in R4, let H be a hyperplane in R* with 
H aint P # @, HoextP= ©, 
and let K be one of the two closed halfspaces bounded by H. Then we have: 


(a) The set P’:= K © P is ad-polytope, and H 2 P is a facet of P’. 

(b) Let F be a face of P such that K 0 F # @. Then F’:= K 1 F is a face of 
P’, and dim F’ = dim F. 

(c) Let F' bea face of P’. Then either F' is a face of the facet H ~ P, or there isa 
unique face F of P such that F'’ = Ko F. 


PROOF. (a) This is obvious, cf. Corollary 9.4. 

(b) Itis obvious that F’ is a face of P’. If F < K,then the dimension formula 
is trivial. If F ¢ K, then there must be points of F on both sides of H; for if 
not, then H would be a supporting hyperplane of F with F < (R“\K) U H, 
contradicting the assumption that H contains no vertex of P. But then H 
must intersect ri F, cf. Theorem 4.1, (a) = (b). This in turn clearly implies 
dim F’ = dim F. 
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(c) Suppose that F’ is not a face of Hm P. Then F’ is not a subset of 
H © P,cf. Theorem 5.2, whence H cn ri F’ = @, cf. Theorem 4.1, (b) = (a). 

We first prove uniqueness of F. If F, and F, were faces of P such that 
F’=KOF,=KoOF,, then we would also have F’ = KO (F,OF,)). 
By (b) we would then have dim F, = dim F, = dim(F, 2-9 F,), implying 
that F, = F,, cf. Theorem 5.2 and Corollary 5.5. 

To prove existence of F, take x,erifF’; then xg¢€ Pint K since 
HoriF' = @. Let H’ bea supporting hyperplane of P’ such that H’ 7 P’ = F’. 
Then H' is also a supporting hyperplane of P. For if some point y of P not in 
K were on the wrong side of H’, then the entire segment |x), y] would be 
on the wrong side; but ]x9, y] contains a whole segment of points from 
K © P = P’, whence H’ could not be a supporting hyperplane of P’. Hence, 
H’ is a supporting hyperplane of P. But then F := H’ 1 P is a face of P with 
K OF = F’,as desired. [ 


If in Theorem 11.11 the set of vertices of P not in K are the vertices of a 
face F, then the polytope P’ is said to be obtained from P by truncation 
of the face F. The operation of truncation produces one “new” facet. 
The old facets of P all “survive”, except of course F itself, if F is a facet. The 
dual operation of truncating a facet is called pulling a vertex. It consists in 
taking the convex hull of the polytope and one “new” vertex (outside the 
polytope) such that one “old” vertex disappears. The dual operation of 
truncating a vertex is that of adding a pyramid over one of the facets. A precise 
description of the duality can be given in terms of polarity as explained in 
Theorem 9.1. 


EXERCISE 


11.1. Let F,, F,, and F, be faces of a polytope P such that F; & F, & F,. Let Q bea 
polytope such that (¥(F3/F,), <) is isomorphic to (¥(Q), <) under the iso- 
morphism g. Verify that (F(F,/F,), <) is isomorphic to (F(g(F>)), <), and 
(F(F3/F,), <) 1s isomorphic to (F(Q/p(F>)), <). 
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In this section we introduce two important classes of polytopes, namely, the 
simple polytopes and the simplicial polytopes. Both classes are defined by 
“non-degeneracy” conditions; actually, the conditions are dual. The “non- 
degeneracy” makes these polytopes much easier to handle than polytopes in 
general; in fact, with one important exception, the combinatorial theory to 
be developed in Chapter 3 deals only with simple and simplicial polytopes. 

Because of the duality there is no formal reason to prefer one of the two 
classes to the other. However, certain problems are treated most conveniently 
in terms of simple polytopes. 
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We remind the reader that an e-polytope is an e-simplex provided that its 
vertices form an affinely independent (e + 1)-family, cf. Sections 2 and 7. We 
begin with a discussion of the facial structure of simplices. 


Theorem 12.1. Let S be an e-simplex in R‘, and let F be a proper face of S. 
Then F is also a simplex. 


PROOF. The vertices of F are those vertices of S which are in F, cf. Theorem 7.3. 
Any subfamily of an affinely independent family of points is itself affinely 
independent. Therefore, since F is the convex hull of its vertices, cf. Theorem 
7.2(c), it follows that F is a simplex. LI 


Theorem 12.2. Let S be an e-simplex in R*%, let X be a non-empty subset of 
ext S, and let F = conv X. Then F is a face of S, and ext F = X. 


Proor. Let ext S = {x,,...,X.+,}, and let us assume that X = {x,,...,X,}: 
To prove that F is a face of S, we shall show that if yp and y, are two points 
of S such that for some t € ]0, 1[, the point 


Y, =(1 — tyo + ty 
is in F, then y, and y, must be in F. Each x in S has a unique representation 


e+1 


x= ys A; X;- (1) 
i=1 


Points x from S§ actually belonging to F are characterized by the property 
that A; = Ofori=k+1,...,e + 1. Now, we have 


e+1 
__ c 
Yo = y hoi Xi 
i=1 
and 
e+] 
__ c 
yi= y Ay iXis 
i=1 
whence 


e+l 


Y= DSC = to; + tAydx:. 
i=1 
But we also have yj, € F, 1.e. 
k 
y= > Ari Xi- 
t= 1 


By the uniqueness of representations (1) we then get 


(1 — t)dg; + taA,; = 9, i=k+1,...,e+ 1. 
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This clearly implies 
Ao; = 41; = 9, i=k+1,...,e4+1, 


whence yp and y, are in F, as desired. 
Finally, Theorem 7.2, (a) = (b) shows that 


ext F< {x,,..., X;,}. 
The opposition inclusion is clear, cf. the “only if” part of Theorem 5.2. (J 


The two theorems above contain the basic information about faces of 
simplices. We have the following corollaries: 


Corollary 12.3. Let S be an e-simplex in R*, and let F be a j-face of S, where 
—1 <j<e. Then for k =j,...,e, the number of k-faces of S containing F 


equals 
e—J 
k—j} 


Proor. By Theorems 12.1 and 12.2 there is a one-to-one correspondence 
between the k-faces of S containing F, and the choices of (k + 1) -G+ 1) 
vertices from the (e + 1) — G+ 1) vertices of S not in F. This proves the 
assertion. C 


Corollary 12.4. Let S be an e-simplex in R*. Then for —1 < k < e, the number 
of k-faces of S equals 

e+1 

k+1} 


PROOF. Take j = —1 in Corollary 12.3. CJ 
Corollary 12.5. Let S be an e-simplex in R*, and let F be a k-face of S, where 
—1<k <e. Then the number of facets of S containing F equals e — k. 


ProoF. It follows from Corollary 12.3 that there are 


e—k 
(ane eee 


facets of S containing a given k-face F. This proves the assertion. LJ 


In Corollary 12.5, note that F is the intersection of the e — k facets con- 
taining F, cf. Theorem 10.4. Conversely, the intersection of e — k facets is a 
k-face: 


Corollary 12.6. Let S be an e-simplex in R%, and let F,,...,F.-, bee —k 
facets of S, where -1 <k<e-—1. Then F,---OF,_, is ak-face of S. 
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PROOF. Let x,,..., X-+, be the vertices of S. By Theorems 12.1 and 12.2, each 
F , is the convex hull of certain e of the e + 1 vertices. We may assume that 


j= CONV({X4,~--, Xo4if\LXjH), j=l,...,e—k. 
Then a point x in S is in F, if and only if in the (unique) representation 


e+1 


x= S°A;X; 
i=1 


we have A, = 0. Therefore, x is in Fy V---O F,-, if and only if A, = --- = 
A.-, = 0, i.e. if and only if x is in the set 


CONV{X,— pes - +> Xe41}- 


But this set is a face of S by Theorem 12.2, and its dimension is k by 
Theorem 12.1. LJ 


Corollary 12.7. Let S be an e-simplex in R‘, and let T be a dual e-polytope. 
Then T is also an e-simplex. 


ProoF. It follows from Corollary 12.4 that S has e + 1 facets. Dually, T has 
e+ 1 vertices, cf. Theorem 10.3. But e-polytopes with e + 1 vertices are 
simplices. CJ 


Corollary 12.8. Let P be an e-polytope in R*. Then P is an e-simplex if and only 
if the number of facets of P is e + 1. 


ProoF. If P is an e-simplex, then P has e + 1 facets by Corollary 12.4. Con- 
versely, if P is an e-polytope with e + 1 facets, then any dual Q of P is an 
e-polytope withe + 1 vertices, cf. Theorem 10.3. Hence, Q is an e-simplex, and 
therefore, by Corollary 12.7, P is also an e-simplex. 


We shall move on to the simplicial and simple polytopes. 

A d-polytope P is said to be simplicial if for k = 0,...,d — 1, each k-face 
of P has precisely k + 1 vertices (i.e. each proper face of P is a simplex). 

Any simplex 1s simplicial, cf. Theorem 12.1, but of course there are many 
other simplicial polytopes. 

In the definition of a simplicial polytope it suffices to require that all facets 
are simplices: 


Theorem 12.9. A d-polytope P is simplicial if (and only if) each facet of P is a 
simplex. 


ProoF. Let F be a proper face of P. By Corollary 9.7 there is a facet G of P 
containing F. Then F is a face of G, cf. Theorem 5.2, and since G is a simplex 
by assumption, F is a simplex by Theorem 12.1. CO 


Let F be a k-face of a d-polytope P, where 0 < k < d — 1. Then by 
Theorem 10.4 there are at least d — k facets of C containing F (and F is the 
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intersection of these facets). A d-polytope P with the property that for 
k =0,...,d — 1, the number of facets of P containing any k-face of P equals 
d — k, is called a simple polytope. 

Any simplex is simple, cf. Corollary 12.5, but of course there are many 
other simple polytopes. 

The two notions introduced above are dual: 


Theorem 12.10. Let P and Q be dual d-polytopes. Then P is simple if and only 
if Q is simplicial. 


Proor. Let F and G be proper faces of P and Q, respectively, corresponding 
under some anti-isomorphism from (¥(P), <) onto (F(Q), <). Then 


dimG=d-—1-—dimF, 


cf. Theorem 10.3. Furthermore, by the duality, saying that F is contained in 
j facets of P is equivalent to saying that G contains j vertices of Q. Therefore, 
saying that each k-face of P is contained in precisely d — k facets is equivalent 
to saying that each (d — 1 — k)-face of Q has precisely d — k vertices, 
k =0,...,d — 1, ie. each proper face of Q is a simplex. This proves the 
statement. L 


The following, in a sense, is a dual of Theorem 12.9: 


Theorem 12.11. A d-polytope P is simple if (and only if) each vertex of P is 
contained in precisely d facets. 


Proor. Let Q be a dual of P. If each vertex of P is contained in precisely d 
facets, then each facet of Q has precisely d vertices, cf. Theorem 10.3. There- 
fore, each facet of Q is a simplex, whence Q is simplicial by Theorem 12.9. 
But then P is simple by Theorem 12.10. LJ 


The following characterization of simple polytopes should be compared 
to Theorem 10.5: 


Theorem 12.12. A d-polytope P is simple if and only if each vertex of P is 
incident to precisely d edges of P. 


Proor. Let QO be a dual of P, and let wy be an anti-isomorphism from (¥(P), <) 
onto (F(Q), <). Let x be a vertex of P. Then the number of edges of P 
incident to x equals the number of (d — 2)-faces of the (d — 1)-face W({x}) 
of QO, cf. Theorem 10.3. Therefore, the number of edges incident to a vertex 
of P is d for each vertex of P, if and only if the number of (d — 2)-faces of a 
(d — 1)-face of Q is d for each (d — 1)-face of Q. A (d — 1)-polytope, however, 
has d facets if and only if it is a simplex, cf. Corollary 12.8. The statement then 
follows from Theorems 12.10 and 12.9. L} 
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In Theorem 12.12, note that “incident to precisely d edges” is equivalent 
to “adjacent to precisely d vertices.” 
Here is one more characterization of simple polytopes: 


Theorem 12.13. A d-polytope P is simple if and only if each vertex-figure of P 
is a simplex. 


PROOF. Let Q be a dual of P. Then the facets of Q are duals of the vertex- 


figures of P, cf. Theorem 11.5. The statement then follows from Theorems 
i2.10 and 12.9. LJ 


We shall next establish some properties of simple polytopes that will be 
needed later. 


Theorem 12.14. Let P be a simple d-polytope, and let F,,..., F4-, bed —k 
facets of P, where0 < k < d — 1. Let 


d-—k 
F:= () F,, 
i=1 


and assume that F # @. Then F is a k-face of P, and F,,..., Fy, are the 
only facets of P containing F. 


Proor. Let Q be a dual of P, and let w be an anti-isomorphism from(F(P), <) 
onto (F(Q), <). By its definition, F is the largest face contained in the F,’s, 
whence y(F) is the smallest face containing the wW(F,)’s. It follows from 
Theorem 10.3 that W(F) is a proper face and that the w(F;)’s are vertices of Q. 
Then y(F) is a simplex, cf. Theorem 12.10, and therefore the y(F;)’s must be 
all the vertices of W(F), cf. Theorem 12.2. Since the number of F;’s is d — k, 
we see that 


dim W(F) = (d — k) — 1. 


This implies by duality that the F;’s are all the facets of P containing F, and 
that 


dim F = k, 
cf. Theorem 10.3. LJ 


Theorem 12.15. Let P be a simple d-polytope. Then every proper face of P is 
also simple. 


Proor. Let F be a proper face of P, and let x be a vertex of F. Letting 
k += dim F, we shall prove that there are precisely k facets of F containing x, 
cf. Theorem 12.11. Let Q be a dual of P, and let w be an anti-isomorphism 
from (F(P), <) onto (F(Q), <). Then by Theorem 10.3, the number of 
facets of F containing x equals the number of (d — 1 — (k — 1))-faces of O 
contained in the facet W({x}) of Q and containing the (d — 1 — k)-face 
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WF) of Q. Now, by Theorem 12.10, Q is simplicial, whence y({x}) is a 
(d — 1)-simplex. Therefore, we are seeking the number of (d — k)-faces of a 
(d — 1)-simplex containing a given (d —1-—k)-face of that simplex. 
Corollary 12.3 tells that this number is 


(d—1)—-(d—1—k)\ _ [k _k 
(d—k)-(d-1-k) J 
as desired. LJ 


Theorem 12.16. Let P be a simple d-polytope. Then for0 <j <k <d there 
are precisely 

d—j 

d—k 


k-faces of P containing a given j-face of P. 


Proor. For k = d, there is nothing to prove. For k < d, let Q be a dual of P, 
and let w be an anti-isomorphism from (¥(P), <) onto (F(Q), <). Let F 
be a given j-face of P. Then w(F) is a (d — 1 — j)-face of Q, and the number 
of k-faces of P containing F equals the number of (d — 1 — k)-faces of w(F), 
cf. Theorem 10.3. By Theorem 12.10, w(F) is a simplex. The desired number 


therefore equals 
(d—1—j)+1\_ [d-j 
(d-1—k)4+1) \d—kP 
cf. Corollary 12.4. CJ 


Theorem 12.17. Let P be a simple d-polytope, let x9 be a vertex of P, let 
X1,.-+, X, be certain k vertices of P adjacent to Xo, and let F be the smallest 
face of P containing [Xo, X,],.-.., [Xo, X,]. Then the following holds: 
(a) dim F = k. 
(b) [xo, X,],---, [Xo, X,] are the only edges of F incident to Xo. 
PRroor. Let Q be a dual of P, and let w be an anti-isomorphism from (¥(P), <) 
onto (F(Q), <). Let G:= W(F). Then, by duality, G is the largest face of Q 
contained in the (d — 2)-faces W([x 9, X;]),---, W(LXo, X;]) of the (d — 1)-face 
wW({xo}) of Q, cf. Theorem 10.3. Since Q is simplicial, cf. Theorem 12.10, 
wW({xo}) is a (d — 1)-simplex. Corollary 12.6 then shows that 
dimG=d—1-—k. 

However, by Theorem 10.3 we also have 

dimG =d—1-—dimF, 


whence dim F = k, proving (a). 
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To prove (b), note that by Corollary 12.5 the number of (d — 2)-faces of 
the (d — 1)-face W({x.}) containing G is only k. By duality, this means that 
there are only k edges of F containing x,.. This proves (b). LJ 


Note that once (a) of Theorem 12.17 has been proved, (b) also follows from 
Theorems 12.15 and 12.12. 

In Theorem 12.17, note that F may also be described as the smallest face 
of P containing x9, X,,..., X,, and (b) is equivalent to saying that x,,..., x, 
are the only vertices of F adjacent to xo. 


Theorem 12.18. Let P be a simple d-polytope in R‘, and let x be a vertex of P. 
Let P’ be a d-polytope obtained from P by truncating the vertex x9. Then P’ is 
also a simple d-polytope. Moreover, 


fo(P’) = fo(P) +d —-1 
and 


d 
p= 1,...,d—1. 
1} J 5) ,d 


Proor. Let K denote the closed halfspace in R* such that P’ = K m P, and 
let H denote the bounding hyperplane of K. 

To see that P’ is simple, we shall show that each vertex of P’ is incident to 
precisely d edges of P’, cf. Theorem 12.12. A vertex x of P’ is either a vertex 
of the facet H © P, ora vertex of P not in H, cf. Theorem 11.11(d). If the latter 
holds, then there are precisely d edges of P’ incident to x by Theorems 12.12 
and 11.11(c), (d). If the former holds, then x is the point where a certain edge 
F of P crosses H, cf. Theorem 11.1(d). The edges of P’ incident to x are then 
the edge K 7 F plus the edges of H 4 P incident to x. But H m Pisa(d — 1)- 
simplex by Theorem 12.13, and therefore the number of edges of H m P 
incident to x equals d — 1, cf. Corollary 12.3. 

As already noted, H © P is a (d — 1)-simplex. Therefore, the number of 
j-faces of H © P equals 


d 
, j= 0,...,d—1, 
() ’ 


cf. Corollary 12.4. The expressions for f,(P’), j = 0,...,d — 1, then follows 
by easy applications of Theorem 11.11. LJ 


SAP) = f(P) + ( 


A d-simplex is both simple and simplicial. We conclude this section by 
proving that the converse is also true when d ¥ 2. (The statement is trivially 
true for d = 0, 1. Any 2-polytope is simple and simplicial, but not all 2- 
polytopes are simplices. Therefore, the statement is not true for d = 2.) 


Theorem 12.19. Let d # 2, and let P be a d-polytope which is both simple and 
simplicial. Then P is a simplex. 
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Proor. As noted above, we need only consider d > 3. Let x, be a vertex of P, 
and let x,,..., X4 be the vertices of P adjacent to x9, cf. Theorem 12.12. Let 


S = conv{Xxy, Xys-ee5 Xa} 


Corollary 11.7 implies that S is a d-simplex with vertices x9, x,,..., Xq- 
Let x; and x; be any two of the vertices x,,..., x,, and let F be the smallest 
face of P containing [x,, x,] and [x9, x,]. Then F is a 2-face by Theorem 
12.17(a). Moreover, since P is simplicial, F is a simplex. In other words, F 
is a triangle, and since Xy, x; and x, are vertices of F,, cf. Theorem 5.2, it 
follows that ext F = {x9, x;, x,}. In particular, [x;, x;] is an edge of F, and 
therefore [x;, x,] is also an edge of P, cf. Theorem 5.2. Now, let K be a 
supporting halfspace of S, and let x, be a vertex of S in the bounding hyper- 
plane H of K. Then trivially all the d edges of S incident to x, are in K. But as 
these edges are also edges of P (as we have proved above), and the number of 
edges of P incident to x, equals d by Theorem 12.12, it follows that all edges 
of P incident to x, are in K. Application of Theorem 11.6 next shows that K 
is also a supporting halfspace of P. Hence, every supporting halfspace of S 
also supports P. Since S is the intersection of its supporting halfspaces, cf. 
Theorem 4.5, it follows that P < S. On the other hand, it is clear that S c P, 
whence P = S, showing that P is a simplex. LJ 


EXERCISES 


12.1. Give a direct proof of Theorem 12.2 when X is an e-subset of the (e + 1)-set 
ext S. Apply this result to prove Theorem 12.2 by induction. 


12.2. Show by counting incidences of vertices and edges that we have dfo(P) = 2f,(P) 
for any simple d-polytope P. (This relation is equivalent to the Dehn—Sommerville 
Relation corresponding to i = 1, cf. Theorem 17.1.) 


12.3. Let F be a face of a simple d-polytope P. Show that the facets of F are the faces 
F © Gsuch that Gis a facet of Pwith F 1G 4 @ and F ZG. 


12.4. Let P be an arbitrary d-polytope, and let P’ be a d-polytope obtained by successive 
truncations of all the facets of P. Show that P’ is simple. Verify that f,_-,(P’) = 
f,-,(P) and f(P’) = f,(P) for j = 0,...,d — 2. Show that if some k-face F of P 
is contained in more than d — k facets, then f(P’) > f(P) for j= 0,...,k + 1. 


12.5. A d-polytope P is said to be k-simplicial if each k-face of P is a simplex, and k-simple 

if each (d — 1 — k)-face is contained in precisely k + 1 facets. 

Verify the following: If P and Q are dual, then P is k-simplicial if and only if Q 
is k-simple. Every d-polytope is 0-simplicial, 1-simplicial, 0-simple and 1-simple. 
A d-polytope is simplicial or simple if and only if it is (d — 1)-simplicial or (d — 1)- 
simple, respectively. If P is k-simplicial or k-simple, then P is also h-simplicial or 
h-simple, respectively, for h < k. 

Prove that if a d-polytope P is k,-simplicial and k,-simple with k, + k, => 
d+ 1,then P is a simplex. 
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§13. Cyclic Polytopes 


It is easy to see that the 3-simplices are the only 3-polytopes with the property 
that any two vertices are adjacent. Surprisingly enough, the same statement 
with 3 replaced by any d > 4 is not true; counter-examples are provided by 
polytopes of the type to be introduced in this section. 


For d > 2, the moment curve .@, in R* is the curve parametrized by 
tre x(t):=(t, t?,..., t%), teR. 


This curve has the following interesting property: 


Theorem 13.1. Any hyperplane H in R‘ contains at most d points from M,. 
Proor. Let H = H(y, «), where 


y = (f,,..., Ba). 
Then x(t) € H(y, «) if and only if 
Byt +--+ + Bat? =a. 


By the Fundamental Theorem of Algebra there are at most d values of t 
satisfying this equation, which proves the assertion. O 


Corollary 13.2. Let t,,...,t, be distinct real numbers, where n< d+ 1. 
Then the n-family (x(t,), ..., x(t,)) of points from R‘ is affinely independent. 


Proor. If (x(t,),..., x(t,)) is affinely dependent, then all the points 
x(t,),..., x(t,) belong to some affine subspace A with dim A < n — 2. If 
n<d+ 1,chooset,,,,...,tg+, such that ¢; 4 t;fori,j = 1,...,d + land 
i # j. Then x(t,), ..., X(tgs,) all belong to some affine subspace of dimension 
at most 


(n—2)+(d+1—n=d-1. 


This shows in particular that x(t,),..., x(tg,,) all belong to some hyper- 
plane, contradicting Theorem 13.1. 


By acyclic polytope of type C(n, d), wheren > d + landd > 2,wemeana 
polytope of the form 
P = conv{x(t,),..., X(t,)}, 


where t,,..., ¢, are distinct real numbers. 
Note that a cyclic polytope of type C(d + 1, d) is a d-simplex by Corollary 
13.2. 


Theorem 13.3. Let P = conv{x(t,),..., x(t,)} be a cyclic polytope of type 
C(n, d). Then P is a d-polytope. 
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ProoFr. By Corollary 13.2, any (d + 1)-family formed by distinct points x(t;) 
is affinely independent. Therefore 


aff {x(t,),..., X(ta+)} = R% 
implying that dim P = d. LJ 
Theorem 13.4. Let P = conv{x(t,),..., x(t,)} be a cyclic polytope of type 
C(n, d). Then 
ext P = {x(t,),..., X(t,)}. 


Proor. The inclusion < follows from Theorem 7.2, (a) = (b). Conversely, to 
show that x(t;) is a vertex of P, consider the polynomium p(t) of degree 2 
defined by 


p(t) = —(t — ti)? 
= —t? + 2t,t — t’. 
Let 
y:=(2t;,, —1,0,..., 0) € R*. 
Then 


p(t) = <x(t), y> — 


Since p(t) < 0 for all t € R, and p(t) = 0 if and only if t = t;, it follows that 
K(), t?) is a supporting halfspace of P with 


H(y, t?’) ™ P — {x(t;)}, 
showing that x(t;) € ext P. LJ 


Theorem 13.5. Let P = conv{x(t,),..., X(t,)} be a cyclic polytope of type 
C(n, d). Then P is simplicial. 


ProoF. Since P is a d-polytope, cf. Theorem 13.3, it suffices to show that any 
facet of Pisa(d — 1)-simplex, cf. Theorem 12.9. Let F bea facet of P. Then the 
vertices of F are certain of the vertices of P, say x(t;,), ..., x(t;,), ef. Theorem 
13.4. Then k > d, with k = d if and only if F is a (d — 1)-simplex. Now, note 
that aff F is a hyperplane containing the k points x(t;,), ..., x(¢;,). Theorem 
13.1 then shows that k < d, whence k = d, as desired. L] 


Theorem 13.5 shows that if certain k vertices of a cyclic polytope P form 
the set of vertices of a face of P, then that face must be a (k — 1)-face. In the 
following we shall describe which sets of k vertices of P are the vertex sets of 
faces of P. 

We need some notation. Let 


P = conv{x(t,),..., X(t,)} 
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by a cyclic polytope of type C(n, d), and assume that 
th <-++ <p. 


(This, of course, is no restriction at all.) Let X be a non-empty subset of 
{x(t,),..., X(t,)}. By a component of X we shall mean a non-empty subset Y 
of X of the form 


Y = {x(t;), X(tj+41),.- +5 X(t 1), X(ty)$ 


such that x(t;_,) € X Gfj > 1) and x(t,,,) € X (fk <n). A component Y is 
called a proper component if x(t,) € Y and x(t,) ¢ Y. A component containing 
an even (or odd) number of points is called an even (or odd) component. 

With this notation we can now handle the case k = d; the remaining 
values of k will be treated below. The condition of the theorem is known as 
Gale’s Evenness Condition. 


Theorem 13.6. Let P = conv{x(t,),..., x(t,)} be a cyclic polytope of type 
C(n, d), where t; <--- <t,. Let X be a subset of {x(t,),..., x(t,)} containing 
d points. Then X is the set of vertices of a facet of P if and only if all proper 
components of X are even. 


PrRoorF. Let 
X = {x(t;,),---, X(ti,)}; 


and note that aff X is a hyperplane by Corollary 13.2. Then Theorem 13.1 
Shows that x(t;,),..., x(t;,) are the only vertices of P in aff X. Let 


p(t) = — I] — t;,). 


Then p(t) is a polynomium of degree d, and therefore there are real numbers 
do, Qi; wee Ag (with ag = 1) such that 


P(t) = day tayt +--+ + at’. 


Let 
y = (a1, ee Aq). 
Then 
p(t) = <x(t), y> + do. 
Since 
P(t;,) = +--+ = plt;,) = 9, 
we see that 
x(t;,), a) x(t;,) E Ay, — Ag) 
whence 


A(y, — Ao) = aff X. 
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Now, saying that X is the set of vertices of some facet of P is equivalent to 
saying that there is a supporting hyperplane H of P such that 


HoextP= X. 


But we have seen that H(y, — a ) is the only hyperplane containing X. There- 
fore, X is the set of vertices of a facet of P if and only if H(y, —ao) supports 
P, i.e. 1f and only if all points from the set 


(ext P)\\X —= {x(t,), sey X(ta)}\ LXE) srg x(ti,) 


are on the same side of H(y, — ao). 

Suppose that not all the points from (ext P)\X are on the same side of 
H(y, — ao). Then there are, in fact, x(t;) and x(t,) from (ext P)\X such that 
t; < t,, all points x(t,) with j <1<k are in H(y, —dag), and x(t;) and x(t,) 
are on opposite sides of H(y, — ao). Saying that x(t,) and x(t,) are on opposite 
sides of H(y, —dap) is equivalent to saying that p(t;) and p(t,) have opposite 
signs. Now, p(t) changes sign exactly at the values t¢ = t;,,..., t,,. Therefore, 
there must be an odd number of values t; between t; and t,. In other words, 
the set 


Y = {X(tj41),---5 X(t, 1)$ 


is an odd proper component of X. This proves the “if” statement. 
To prove the “only if” statement, we reverse the argument above. In fact, 
suppose that there is an odd proper component of X, say 


Y= {x(tj4 1), ane) X(t, 1)}. 


Then, by the definition of a proper component, x(t;) and x(t,) are in 
(ext P)\ X. Therefore, p(t;) and p(t,) are #0, and since p(t) changes sign at 
t = tj41,---,¢,-, when t increases from t; to t,, we see that p(t,) and p(t,) 
must have opposite signs, showing that x(t;) and x(t,) are on opposite sides 
of H(y, —do). This completes the proof. 


We next use Theorem 13.6 to treat the remaining values of k. 


Theorem 13.7. Let P = conv{x(t,),..., x(t,)} be a cyclic polytope of type 
C(n, d), where t, <--- <t,. Let X bea subset of {x(t,),..., x(t,)} containing 
k points, where k < d. Then X is the set of vertices of a(k — 1)-face of P if and 
only if the number of odd proper components of X is at most d — k. 


Proor. The set X is the set of vertices of a face of P if and only if there is a 
facet G of P such that 


X cext G. (1) 


In fact, if X = ext F for some face F of P, then by Corollary 9.7 there is a facet 
G of P containing F, whence (1) holds. Conversely, if (1) holds for a certain 
facet G, then X = ext F for some face F of G since G is a simplex, cf. Theorems 
13.5 and 12.2; but then F is also a face of P. 
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Now, by Theorem 13.6, the existence of a facet G such that (1) holds is 
equivalent to the existence of a (d — k)-subset Z of (ext P)\X such that all 
proper components of X U Z are even. This, in turn, is clearly equivalent to 
saying that the number of odd proper components of X is at most d — k, as 
desired. LI 


For small values of k, Theorem 13.7 takes the following form: 


Corollary 13.8. Let P = conv{x(t,),..., x(t,)} be a cyclic polytope of type 
C(n, da), and let k be an integer such that 


1<k<l(|d/2]. 
Then any k of the points x(t,), ..., x(t,) are the vertices of a(k — 1)-face of P. 


Hence, 
n 
fy—1€P) = () 


Proor. As in Theorem 13.7, we assume that t; <---<t,. Whenk < |d/2], 
then d — k > k. Since the number of (odd proper) components of X cannot 
exceed the number of points in X, the conclusion follows immediately from 
Theorem 13.7. LJ 


Corollary 13.8 1s really striking. It shows, for example, that for any d > 4 
there are d-polytopes P with as many vertices as desired such that any two 
vertices of P are adjacent. 

We conclude this section with the following: 


Corollary 13.9. Let P = conv{x(t,),...,x(t,)} and Q = conv{x(s,),..., x(s,)} 
be cyclic polytopes, both of type C(n, d). Then P and Q are equivalent. 


Proor. We may assume that t; <--- <t, and s; <--- < s,. For any face 
F of P with vertices x(t;,),..., x(t;,), define 


P(F) = conv{x(5s;,), -- +5 X(S;,)}- 


Theorem 13.7 then shows that @ is in fact an isomorphism from (¥(P), <) 
onto (F(Q), <). 


EXERCISES 


13.1. Use Theorem 13.6 to show that for any cyclic polytope P of type C(n, d), the 
number of facets of P is given by 


n n — d/2 F 
even; 
n—d2\n—-ay “oO 
Hn O = d + 1)/2 
a("~ + by ) d odd. 
n—d 
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Verify that in both cases, 


n— (d+ 1)/2 — L(d + 2)/2 
yin, d) = (7 Ud + DIAN | (m — Lid + 29/21) 
n—d n—d 
(After reading Section 18, this should be compared to the case j = 0 in Theorem 


18.2.) 


13.2. Show that if P is a cyclic polytope of type C(n, d), where d is even, then each 
vertex-figure of P is equivalent to a cyclic polytope of type C(n — 1,d — 1). 

13.3. Verify that if P is a cyclic polytope of type C(n, d) such that each vertex-figure of P 
is equivalent to a cyclic polytope of type C(n — 1, d — 1), then 

n-u(n — 1,d — 1) =d- p(n, a), 

cf. Exercise 13.1. Use this to show that if d is odd and n > d + 2, then not every 
vertex-figure of P :s equivalent to a cyclic polytope. 

13.4. Give a direct proof of Corollary 13.8 by expanding the idea of the proof of Theorem 
13.4. 


§14. Neighbourly Polytopes 


In Section 13 we met examples of d-polytopes P with the property that for 
certain values of k, every k-subset of ext P is the set of vertices of a face of P. 
In this section we shall study general properties of such polytopes. 


Let k be a positive integer. We shall say that a d-polytope P with at least 
k + 1 vertices is k-neighbourly if every k-subset of ext P is the vertex set 
of a proper face of P, i.e. conv X is a proper face of P for every k-subset 
X of ext P. Ford > 1, every d-polytope is 1-neighbourly and every d-simplex 
is k-neighbourly for all k < d. 

We would like to comment on the condition that P should have at 
least k + 1 vertices. If P has k vertices, then there is only one k-subset of 
ext P, namely, ext P itself; this set, however, is not the vertex set of a proper 
face. If P has fewer than k vertices, then there are no k-subsets of ext P, 
and therefore formally every k-subset of ext P is the vertex set of a proper 
face of P. So, without the condition that P should have at least k + 1 vertices, 
P would be k-neighbourly for all k > card(ext P). 

For k-neighbourliness of a d-polytope P, only k < d 1s possible. In fact, 
ifk >d-+ 1, then (assuming that P has at least k vertices) we can find a k- 
subset X of ext P such that a certain (d + 1)-subset of X forms an affinely 
independent (d + 1)-family; the convex set spanned by X must then have 
dimension d, and therefore it cannot be the vertex set of a proper face. 
Actually, we shall prove below (cf. Corollary 14.4) that except for simplices 
only k < |d/2| is possible. 

Theorem 13.5 and Corollary 13.8 imply: 


Theorem 14.1. Anycyclic polytope P of type C(n, da) is a simplicial k-neighbourly 
polytope for allk < |d/2]. 
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We shall next study k-neighbourly polytopes in general. 


Theorem 14.2. Let P be a k-neighbourly d-polytope, and let 1 <j < k. Then 
P is also j-neighbourly. 


Proor. Let X be any j-subset of ext P. Since 
card((ext P)\X) >d+1—-j 
>(k —sj) +1, 
we see that for any vertex x of P not in X there is a k-subset Y(x) of ext P 
with X c Y(x) and x ¢ Y(X). Let 
F:= () conv Y(x). 


xe (ext P)\X 


Since each conv Y(x) is a face of P, it follows that F is a face of P containing 
X, but not containing any vertex of P not in X. This shows that 


ext F = X, 
whence P is j-neighbourly. = 


Theorem 14.3. Let P be a k-neighbourly d-polytope, and let X be a subset of 
ext P containing at least k + 1 points. Then Q := conv X is also k-neighbourly. 


Proor. Let Y bea k-subset of ext O = X. It follows from the k-neighbourliness 
of P that the set conv Y is a (proper) face of P. Being a proper subset of Q, 
it must then also be a proper face of Q. C 


The next theorem has important implications. 


Theorem 14.4. Let P be a k-neighbourly d-polytope. Then every face F of P 
with 
0 < dim F < 2k -1 


is a simplex. 


Proor. Let j:= dim F. Suppose that F is not a simplex. Then F has at least 
j + 2 vertices. Let M be a (j + 2)-subset of ext F. By Radon’s Theorem, 
Corollary 2.7, there are non-empty complementary subsets M, and M, of M 
such that 


conv M, n conv M, ¥ @. (1) 


At least one of the two sets M, and M, contains at most k points. In fact, if 
both contained more than k points, then we would have 


j+2=cardM,+cardM,>(k+1)+(kK+1)) 
= 2k +2>dimF + 3 
=j+ 3, 
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a contradiction. We may assume that card M, < k. Then by Theorem 14.2 
and the k-neighbourliness of P, the set conv M, is a proper face of P. Let H 
be a supporting hyperplane of P such that 


Ho P=conv M,, 
and let 
yeconv M, n conv M,, 


cf. (1). Then, in particular, y is in H, and therefore at least one vertex of 
conv M, must be in AH. Such a vertex must then be a vertex of conv M,, 
which is contradicted by the fact that M, and M, are disjoint. This completes 
the proof. LJ 


Corollary 14.5. Let P be a k-neighbourly d-polytope, where |.d/2| < k. Then 
P is a simplex. 


PRooF. Since |d/2] < k implies d < 2k — 1, we can apply Theorem 14.4 
with F = P. LJ 


Note that, as a consequence, the only 2-neighbourly and 3-neighbourly 
3-polytopes are the 3-simplices. So, the notion of k-neighbourly d-polytopes 
is only of real interest for d > 4. 


Corollary 14.6. Let P be a (d/2)-neighbourly d-polytope, where d is even. 
Then P is simplicial. 


PRooF. Let F be a facet of P. Then dim F = d — 1 = 2k — 1 withk = d/2. 
Theorem 14.4 next shows that F is a simplex, whence P is simplicial, cf. 
Theorem 12.9. LJ 


Theorem 14.7. A simple d-polytope P is a dual of a k-neighbourly polytope if 
and only if any k facets of P have a non-empty intersection. 


Proor. Let Q be a dual of P. Then Q 1s a d-polytope by Theorem 10.3, and 
QO is simplicial by Theorem 12.10. By the duality, any k vertices of Q belong 
to a proper face of Q if and only if any k facets of P have a non-empty inter- 
section. But since Q is simplicial, then any k vertices of Q belonging to a 
proper face of O are actually the vertices of a proper face. This proves the 
statement. LJ 


Theorem 14.1 and Corollary 14.5 show that except for simplices, no 
polytopes are “more neighbourly” than the cyclic polytopes. In the following, 
| d/2 |-neighbourly d-polytopes will simply be called neighbourly polytopes. 
The duals of such polytopes are called the dual neighbourly polytopes. 

There are neighbourly polytopes other than those equivalent to cyclic 
polytopes. This is trivial for 3-polytopes since every 3-polytope is neighbourly. 
However, higher-dimensional examples are known. 
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Finally, let us remark that for odd d > 3 there are non-simplicial neigh- 
bourly polytopes, cf. Corollary 14.6. In fact, let P be a d-pyramid in R* whose 
basis Q is aneighbourly (d — 1)-polytope. Then P is neighbourly, cf. Theorem 
7.7. On the other hand, if Q is not a simplex, then P is not simplicial. 


EXERCISES 


14.1. Let P be a k-neighbourly d-polytope. Show that each vertex-figure of P is (k — 1)- 
neighbourly. 


14.2. Show that every neighbourly d-polytope is (d — 2)-simplicial, cf. Exercise 12.5. 


§15. The Graph of a Polytope 


The vertices and edges of a polytope P form in an obvious way a non-oriented 
graph which we shall denote by &(P). (For graph-theoretic notions, see 
Appendix 2.) In this section we shall obtain information about connectedness 
properties of YP). The proofs will be based on a technique for turning GP) 
into an oriented graph GP, w) by means of an “admissible” vector w. This 
“oriented graph technique” will also be used in later sections. 


In the following, let P be a d-polytope in R*, where d > 1. A vector we R® 
is said to be admissible for P if <x, w>) # <y, w> for any two vertices x and y 
of P. Geometrically, this means that no hyperplane in R? with w as a normal 
contains more than one vertex of P. 

Concerning the existence of admissible vectors we have: 


Theorem 15.1. For any d-polytope P in R‘, the set of admissible vectors is 
dense in R‘, i.e. for any y € R4 and any ¢ > 0 there is an admissible vector w 
with ||y — wl] < e. 


Proor. We first remark that the union of a finite number of hyperplanes in 
RR‘ has no interior points. This follows by repeated application of the observa- 
tion that for any non-empty open set O in R¢ and any hyperplane H in R‘%, 
the set O\H is again non-empty and open. 

Now, let 


ext P = {x,,..., X;} 
and let 
Ve= {x, — xij = 1,...,k;1 4 jh. 
From the remark above it follows that the open ball 


{ze R*| lly — zl < e} 
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is not contained in the union of the hyperplanes H(v, 0), v € V. In other words, 
there is a w € R* such that ||y — w|| < cand <x; — x;, w> 4 O for i ¥ j. This 
proves the statement. LC] 


Any vector w which is admissible for P induces an orientation of the edges 
of P according to the following rule: An edge [x, y] is oriented towards x 
and away from y if 


(x, WD) > Cy, W). 


The oriented graph thus defined will be denoted by Y(P, w). 

Let w be admissible for P. Calling w the “down direction,” we see that 
the edges of GP, w) are oriented “downwards.” In the following, we shall 
maintain this terminology which enables us to state that one vertex is 
“above” or “below” some other vertex, etc. We can also speak about the 
“top” vertex and the “bottom” vertex of P. 

Given an admissible vector w for P, it is clear that the top vertex of P has 
in-valence 0 in Y(P, w) and that the bottom vertex of P has out-valence 0 in 
G(P, w). We actually have: 


Theorem 15.2. In a graph YP, w), the top vertex is the only vertex of P whose 
in-valance is 0, and the bottom vertex is the only vertex of P whose out-valence 
is QO. 


Proor. Let x be a vertex of P whose in-valence is 0. Then all the vertices of 
P adjacent to x are below x. This implies that there is a hyperplane H with w 
as anormal such that x is above H and all the vertices adjacent to x are below 
H. Using Theorem 11.8 we then see that all vertices of P except x are below 
H, showing that x must be the top vertex. 

The statement about the bottom vertex can be proved in a similar manner, 
or by observing that when w is admissible, then — w is also admissible, and, 
moreover, the in-valence of a vertex x in Y(P, —w) equals the out-valence of 
x in &(P, w). LJ 


Theorem 15.3. Let P be a d-polytope in R*, and let F be a proper face of P. 
Then there is an admissible vector w such that each vertex of F is above each 
vertex of P not in F. 


Proor. Let H(y, «) be a supporting hyperplane of P with H(y, ~) 7 P = F, 
cf. Theorem 7.5. We may assume that <x, y> > a« for all x € P. Let 


yi= min{<x’, y>|x’ e ext P\ext F} 
and let 


O:= max{||x’ — x”|||x’ e ext P\ext F, x” € ext F}-. 
Note that y > a. Take 
é:=(y — a)/20. 
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Now, by Theorem 15.1 there is an admissible vector w such that ||y — w|| < «. 
Let z:= y — w. Then for any x’ € ext P\ext F and any x” € ext F we have 


<x’ _ x", w> _ <x’ _ x", y- z» 
= Cx’, y> ~ <x", y>? ~ «x! ~~ x", Zz» 


> y—a— |x’ — x" Iz 
>y—a- 6e=(y — «)/2 
> 0, 
from where the statement follows immediately. LJ 


Theorem 15.4. Let P be a d-polytope in R‘%, let F be a proper face of P, and let 
M bea subset of ext F. Then there is an admissible vector w for P such that for 
any vertex x of P not in M there is a path in GP) joining x and the bottom 
vertex v of GP, w) without entering M. 


Proor. By Theorem 15.3 there is an admissible vector w such that each vertex 
of F is above each vertex not in F. Let x be any vertex not in M. When x is 
the bottom vertex v, there is nothing to prove. When x # 1, it follows from 
Theorem 15.2 that there is at least one edge going downwards from x. Let 
this edge be [x, x, ]. If x, = v, we have a path from x to v. If x, # v, then 
Theorem 15.2 takes us one step further down. Continuing this way, we obtain 
a “descending” path joinint x and v. It remains to be shown that we can stay 
outside M. Note that once we are outside F, we are below F, and therefore 
we stay outside M from that point on. So, if we can choose the first edge 
of the path in such a manner that the edge is not an edge of F, we have the 
desired conclusion. If x is not in F, this is automatically fulfilled. If x is in F, 
we apply Corollary 11.7 to see that at least one edge of P with x as an end- 
point is not in F. This completes the proof. CI 


Theorem 15.5. Let P be a d-polytope, let F be a proper face of P, and let M be 
a (possibly empty) subset of ext F. Then the subgraph of GP) spanned by 
(ext P)\M is connected. In particular, GY(P) is connected. 


PRooF. Denoting by I the subgraph of Y(P) spanned by (ext P)\M, Theorem 
15.4 shows that there is a vertex v of T such that any vertex of [ can be joined 
to v bya path inT. This implies that any two vertices of I can be joined by a 
path in I via v, showing that I is connected. Taking M = @ shows that 
G(P) is connected. LJ 


Theorem 15.5 showed that G(P) is connected. A much stronger result is the 
following: 


Theorem 15.6. Let P be a d-polytope. Then GP) is d-connected. 
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PROOF. It suffices to show that for any set N of d — 1 vertices of P, the sub- 
graph I spanned by (ext P)\N is connected, cf. Appendix 2, Theorem A2.1. 
If there is a proper face F of P such that N c ext F, then the connectedness 
of I’ follows from Theorem 15.5. If no such face F exists, then—assuming 
that P < R*—any hyperplane containing N must intersect int P. Choose 
such a hyperplane H containing at least one more vertex x, of P. Let K, 
and K, be the two closed halfspaces bounded by H, and let P, := K, A P 
and P,:= K, © P. Then P, and P, are d-polytopes, cf. Theorem 9.2. More- 
over, the set F := H ~ P is a facet of both. The vertices of F are the vertices 
of P in H and the 1-point intersections of H and edges of P crossing H, cf. 
Theorem 11.1(b), (d). Let 


M:=N wu (ext F\ext P). 


Let I, denote the subgraph of &(P,) spanned by (ext P,)\M, and let T, 
denote the subgraph of Y(P,) spanned by (ext P,)\M. Theorem 15.5 states 
that ’, and I’, are connected. Now, let x be any vertex of P not in N. Then 
x is a vertex of [, or I’, (or both); assume that x is a vertex of [,. Then by the 
connectedness of I’, there is a path in I’, joining x and x,. This is a path in 
G(P) not entering N. Hence, any two vertices of P not in N can be joined by a 
path in GP) not entering N via the vertex x,, showing that the subgraph 
of YP) spanned by (ext P)\N is connected. LJ 


By a facet system in a polytope P we mean a non-empty set Y of facets of 
P. Each &(F), F € ¥, is then a subgraph of GP). The union of the subgraphs 
GF), Fe FS, is denoted by ¥(S). We shall say that a facet system ¥ is 
connected if GF) is a connected graph. 


Theorem 15.7. Let S be a connected facet system in a simple d-polytope P, 
where d > 2. Then GS) is a (d — 1)-connected graph. 


Proor. We prove the statement by induction on the number n of members 
of S. For n = 1, the statement follows immediately from Theorem 15.6. 
For n > 2, we may number the members F,,..., F,, of Y in such a manner 
that the subsystem 4’ formed by F,,..., F,,_, 1s also connected. (Take F, 
arbitrary, use the connectedness of Y to find F, such that {F,, F,} is 
connected, use the connectedness of Y to find F, such that {F,, F,, F3} is 
connected, etc.). Then use the induction hypothesis to deduce that GY’) is 
(d — 1)-connected. Now, by the connectedness of / there is an Ff; with 
j<n—1such that F; F, 4 ©. It then follows from Theorem 12.14 that 
F,OF, 1s a (d — 2)-face of P. Therefore, F; and F,, have at least d — 1 
vertices in common. Hence, the graphs G(.%’) and G(F,,) have at least d — 1 
vertices in common. Since both graphs are (d — 1)-connected, it follows that 
their union, i.e. YS), is (d — 1)-connected, cf. Appendix 2, Theorem A2.2. 
L] 
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EXERCISES 


15.1. A semi-shelling of a d-polytope P is a numbering F,, F,,..., F,, of the facets of P 
such that for i = 2,...,k, the set 


i-1 
F.0 UF; (1) 
j=l 
is anon-empty union of (d — 2)-faces of F;. Show by a duality argument that every 
d-polytope admits semi-shellings. 
Verify that any facet can be taken as F,. Verify that the facets containing a given 
face can be taken to precede all the remaining facets. 
(A semi-shelling is a shelling if, in addition, for i = 2,...,k — 1 the set (1) is 
homeomorphic to a (d — 2)-ball. If P is simplicial, then this condition is auto- 
matically fulfilled.) 


15.2. A graph is said to be planar if, loosely speaking, it can be drawn in the plane with 
non-intersecting (not necessarily rectilinear) edges. Show that the graph GP) of 
any 3-polytope P is planar. (Along with Theorem 15.6, this proves the easy part of 
Steinitz’s Theorem: A graph I is (isomorphic to) the graph &(P) of a 3-polytope P 
if and only if it is planar and 3-connected.) 


CHAPTER 3 
Combinatorial Theory 
of Convex Polytopes 


§16. Euler’s Relation 


At the beginning of Section 10 it was indicated that the combinatorial theory 
of convex polytopes may be described as the study of their face-lattices. When 
it comes to reality, however, this description is too ambitious. Instead, we 
shall describe the combinatorial theory as the study of f-vectors. For d > 1, 
the f-vector of a d-polytope P is the d-tuple 


FOP) = (folP)s fi(P), «+» fa-1(P))- 


where f,(P) denotes the number of j-faces of P, cf. Section 10. Equivalent 
polytopes have the same f-vector, but the converse is not true in general. 

It may be said that the basic problem is as follows: Which d-tuples of 
positive integers are the f-vectors of d-polytopes? Denoting by Y the set of 
all d-polytopes and by f (#") the set of all f-vectors of d-polytopes, the problem 
amounts to determining the subset f(Y") of R*. This problem has only been 
solved completely ford < 3, the cases d = 1, 2 being trivial. 

In this section we shall determine the affine hull aff f(#*) of the set f(P"). 
This partial solution to the basic problem 1s a main general result in the area. 


We first prove that there is a linear relation which 1s satisfied by the 
numbers f,(P), j = 0,...,d — 1, for any d-polytope P. For technical reasons, 
we prefer to include the numbers f_ ,(P) = 1 and f,(P) = 1. The relation is 


known as Euler’s Relation: 


Theorem 16.1. For any d-polytope P one has 


XY CDF) = 0 
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Note that Euler’s Relation may also be written as 
: d . 
» (-1)F(P) = 1, 
j=0 
or 


d—1 
» (- DFP) = 1 — (- 
j=0 
=1+(-1)*". 
Since f(P) = 0 when j > d = dim P, we may also write 


» (-1)f(P) = 9, 
j2-1 
thus avoiding reference to the dimension of P. 
PROoF. We use induction on d. For d = 0, | there is nothing to prove and 
ford = 2 the statement is obvious. So, let d be at least 3, assume that the state- 
ment is valid for all polytopes of dimension < d — 1, and let P be a d- 
polytope. Assuming that P < R*%, we choose an admissible vector w for P, 
cf. Theorem 15.1. Let x,,..., x, be the vertices of P, numbered such that 
CX;,W> < (X41, WD, i=1l,....n—1. 
Calling w the down direction as we did in Section 15, this means that x;,, is 
below x;. Let 
Xo;-1,¢°> <X;, Ww, i= lL... 57. 
Noting that «,;_, < o;4,, we next choose a,; such that 
X2i-1 S %25 < M41, i=l,....n—1, 
and define 
H,,:= H(v, a), k=1,...,2n —1. 
Then the H,’s form a collection of parallel hyperplanes with H,,, below H, 
such that the H,’s with odd values of k pass through the vertices of P. Let 
P,:= A, 0 P, k= 1,...,2n — 1. 


Then P, is a (d — 1)-polytope for k = 2, ..., 2n — 2, whereas P, = {x,} 
and P,,_, = {x,}. By the induction hypothesis, Euler’s Relation is valid for 
the polytopes P,, whence 


2X CDP d = 0. k=1,...,2n—1. 
j2- 
Multiplying by (— 1)**' and adding, we get 


2n-1 


XC YH DiPy) = 0. 


jz7- 
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which we may rewrite as 


2n—1 
LCD" LY (viylPe = 0. (1) 
We shall prove that 
2n-1 —1, j= -1; 
» (- If (P)) = fr —fo(P), j = 9; (2) 
eet Sis (P), j=l,...,d—1. 


Combining (1) and (2), we obtain the desired relation. 

To prove (2) for j = —1, note that f_ ,(P,) = 1 for all k, whence the left- 
hand side is an alternating sum of the form —1 + 1 —--- + 1 — 1, whichhas 
the value —1. 

To prove (2) for j > 0, we define for Fe A(P) and k = 1,...,2n — 1, 


WE y= 4, van E # ©; 
fH,arnF = ©. 
Furthermore, we denote by ¥(P) the set of j-faces of P. 

Let us first consider the case 1 <j < d — 1. Here a j-face of P has at 
least two vertices, and since each H, contains at most one vertex of P, it 
follows that no j-face of P, is a face of P. Theorem 11.1(d) then shows that 
for each j-face F’ of P, there is a unique face F of P such that F’ = H, 2 F, 
and this face F is a (j + 1)-face. Under these circumstances, it is clear that 
AH, ari F 4 &. Conversely, if F is a (j + 1)-face of P with H, Ori F # 
©, then F’:= H, - F is a j-face of P,, cf. Theorem 11.1(b). In conclusion, 
for fixed j and k, the mapping 


Fie F’:=H,OF 


iS a one-to-one mapping from the set of (j + 1)-faces F of P with W(F,k) = 
1 onto the set of j-faces of P,. Therefore, 


f{(P)= d WF.k). — Q) 


FeF ;+1(P) 


Having established (3), we may rewrite the left-hand side of (2): 


2n—1 2n-—1 
x (pe) = Y(-DE YO WERK 
k=1 k=1 Fe Fj+4(P) 
= YY (vWF, bk). (4) 


FeFj4i(P) k=1 


Now, let us consider a fixed (j + 1)-face F of P. Let x;, be the top vertex of F, 
and let x;, be the bottom vertex of F. Then the values of k such that W(F,k) = 1 
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are the values k = 2i,, 2i, + 1,..., 2i, — 3, 2i, — 2. Here the number of 
even values is | larger than the number of odd values, whence 
2n—-1 
» (-1)WeF, k) = 1. (5) 
k=1 
Combining (4) and (5) we then get 
2n-1 
~(-OF(PdI= Dd 1 
k=1 FeF ;+1(P) 
= fis i(P), 


proving (2) for! <j<d-—1. 

The case j = 0 requires a little more care. For k even, H, contains no 
vertex of P. It then follows that for k even, no 0-face of P, is a face of P. We 
can then argue as above and we obtain 

fo(P.) = > WER), — keven. (6) 
Fe F,(P) 
For k odd, the situation is slightly different. In this case, H, contains one 
vertex of P which is then also a vertex of P,. For the remaining vertices of P, 
we can next argue as above, thus obtaining 


f(P) = > WF, +1, k odd. (7) 
Fe F,(P) 
Using (6) and (7), the left-hand side of (2) may be rewritten as 
2n-1 2n-1 2n-1 
» (-I*f(P) = Y(-D' Vo WEA + Db (-1 
k=1 k=1 FeF¥(P) k=1 
k odd 
2n-1 

= YY CVE, ’) — folP), (8) 


Fe F,(P) k=1 


where we have used the fact that the number of odd values of k equals fo(P). 
We next argue as in the case 1 < j < d — 1 to obtain 


2n-1 


DDE, KI (9) 


for any F € ¥ ,(P). Combining (8) and (9) we then get 


2n-—1 


» (—Dfo(P.) = d 1-folP) 


FeF,(P) 
= f(P) — fo(P), 
proving (2) for j = 0. This completes the proof. LJ 
Now, for d > 1 let 
e:= (1, —1,...,(—1)* Je R4 
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Then H(e, 1 — (—1)*) is a hyperplane in R* which we shall call the Euler 
Hyperplane. Note that o € H(e, 1 — (—1)*) if and only if d is even. Theorem 
16.1 shows that f(P*) is contained in the Euler Hyperplane. We shall prove: 


Theorem 16.2. The Euler Hyperplane H(e, 1 — (—1)*) is the only hyperplane 
in R¢ which contains f (P"). 


PRoor. We use induction on d. For d = 1, 2, the statement is obvious. So, 
let d be at least 3 and assume that the statement holds for all dimensions 
<d— 1.Let H = H(y,«) be any hyperplane in R¢ such that f(A) < H(y, «). 
We shall prove that H(y, «) = H(e, 1 — (—1)*) by showing that there is a real 
c # Osuch that y = ce and « = c(1 — (—1)”). 

Let Q be any (d — 1)-polytope, and let Q’ be an equivalent of Q in R*. 
Let P, be a d-pyramid in R* with Q’ as a basis, and let P, be a d-bipyramid 
with Q’ as a basis. We can express f(P,) and f(P,) in terms of f(Q’) = f(Q). 
In fact, it follows from Theorem 7.7 that 


F(P1) = (fo(Q) + 1,f1(Q) + fo(Q), ---,fa-2(Q) + fa-3(Q), 1 + fa-2(Q)), 


and it follows from Theorem 7.8 that 


F(P2) = (folQ) + 2, f1(Q) + 2fo(Q), ---,fa-2(Q) + 2fa-3(Q), 2fa-2(Q)). 
Now, writing 
V = (01,---, Oa) 


we have 


ay(folQ) + 1) + a2 fi(Q) + fo(Q)) + --- 
s+ tg 1(fa-2(Q) + fa-3(Q)) + Hall + fa-2(Q)) = & (10) 


since f(P,)€ H(y, «), and we have 


%1( fo(Q) + 2) + %2(f1(Q) + 2fo(Q)) + °°: 
+ Og 4(fy—2(O) + 2fg-3(Q)) + %2fg_-2(Q) =a (11) 


since f(P,) € H(y, «). Subtraction of (10) from (11) yields 
&1 + %2 fo(Q) +--+ %a~1fa-3(Q) + 4a fa- 22) — 1) = 9. (12) 
We reqrite (12) as 
2 fo(Q) + %3 fi(Q) +--+ + %-1 fa-3(Q) + %a fa-2(Q) = Ma — %- C13) 
Letting 
Zi= (G,..., &q), 
it follows from (13) that 
f(O)€ H(z, %g — &;). (14) 
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Since y # o by assumption, we have a; 4 0 for at least one j = 1,..., d. But 
then also a, # 0 for at least one j = 2,...,d, implying that H(z, a, — «,)isa 
hyperplane in R*~*. As Q is arbitrary, we see that H(z, «, — «,) is a hyper- 
plane in R*~* containing f(P*~ *), cf. (14). By the induction hypothesis, this 
implies that 


(4,...,%) = yl, —1,...,(—1)*-”) 


and 

%q — oy = y(1 — (-1)*"*) 
for a suitable y 4 0. Taking c:= —y, we then see that y= ce and «= 
c(1 — (—1)), as desired. L) 


An immediate consequence of Theorems 16.1 and 16.2 is the following 
main result: 


Corollary 16.3. The affine hull aff f(P*) of f(P*) is the Euler Hyperplane 
H(e, 1 — (—1)9. 


We conclude this section with a variant of Euler’s Relation. For faces 
F, and F, of a d-polytope P with F, < F, we shall write f{F,/F,) for the 
number of j-faces F of P such that F, c F c F,. Note that f(F,/F,) = 0 
for —1 < j < dim F, and for dim F, < j. We shall establish a linear relation 
between the numbers f({F,/F,) when F, & F3. 


Theorem 16.4. Let F, and F, be faces of a polytope P with F, | F,. Then 
» (-1)f(F2/F1) = 0. 


j2-1 
PROOF. We know by Theorem 11.4 that there is a polytope Q with 
dim Q = dim F, — 1 — dim F, 


such that the lattice (¥(F ,/F ,), < )is isomorphic to the face-lattice (F (Q), <), 
and, moreover, 


dim G = dim F — 1 — dim F, 
when F, c F c F, and Gis the face of QO corresponding to F. In particular, 
SAF 2/F 1) = fi€Q) (15) 
when 
k=j—1-—dim F,. 
Euler’s Relation for Q may be written as 


Y (- DQ) = 0. 


k>-1 
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Using (15), we next obtain 
Y (HI aim PF ,/F 1) = 0, 
j2-1 


which is clearly equivalent to the desired relation. LJ 


Of course, in Theorem 16.4 we recover Euler’s Relation by taking F; = @ 
and F, = P. 


§17. The Dehn—Sommerville Relations 


In the preceding section we showed that the affine hull of f(A“) has dimension 
d — 1. Denoting by #4 the set of all simple d-polytopes, and by f(#4) the 
set of all f-vectors of simple d-polytopes, we shall prove that the dimension 
of aff f(P%) is only |d/2]. Moreover, we shall find “representations” of 
aff f (F>). 

(There is no standard notation for the set of stmple d-polytopes. Our 
notation # is inspired by the standard notation Y4 for the set of simplicial 
d-polytopes.) 


We first exhibit a set of linear relations which are satisfied by the f-vectors 
of all simple d-polytopes. These relations are known as the Dehn—Sommerville 
Relations: 

Theorem 17.1. For any simple d-polytope P we have 
Yep, A )AP) =f0) 
j=0 — i 
fori =0,...,d. 
Note that, effectively, we only sum from j = 0 toj = i. For i = d we get 


Euler’s Relation. For i = 0 we get the trivial relation fo(P) = fo(P). 
For d < 2, everything is trivial. For d = 3, the relations are 


So(P) = fo(P), 
3fo(P) — fi(P) = f\(P), 
3fo(P) — 2f:(P) + fo(P) = f2(P), 
fo(P) -— f,(P)+f(P) -—1=1. 


These four relations are equivalent to the following two: 
3fo(P) — 2f,(P) = 0, 
fo(P) — fiCP) + f(P) = 2. 
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It is interesting to note that if one of the three numbers f,(P), /;(P), and f,(P) 
is known, then the remaining two are determined by the Dehn-Sommerville 
Relations. In particular, we can express f)(P) and f,(P) by f,(P): 


fo(P) = 2f(P)— 4, — fi(P) = 3f,(P) — 6. 


(See Theorem 17.6 and Corollary 17.7 below for a d-dimensional version.) 


PRrooFr. For any non-empty face F of P, Euler’s Relation states that 


> (- DAF) = 0. (1) 

j2-1 
Using the notation ¥,(P) for the set of i-faces of P as we did in the proof of 
Euler’s Relation, it follows immediately from (1) that fori = 0,...,d we have 


» » (-1ffF) = 0, 


FeF,(P) j>—-1 


or, equivalently, 


Y (-1) & f(F)=0. (2) 
j2-1 Fe F,(P) 
The value of the sum 
» SAF) 
Fe F,(P) 


is the number of pairs (G, F) of faces of P such that dim G = j, dim F = i 
and G c F. Therefore, 


> {= > fiP/o). (3) 


Fe F;(P) Ge F ,(P) 


For dim G = j > 0, the number f,(P/G) was determined in Theorem 12.16; 
in fact, we showed that 


fP/6) = (4 ~ 1) (4 


Note that fori < j < d, both sides of (4) are 0, and so (4) is valid for0 <j < d. 
For dim G = j = —1, it is clear that 

f(P/G) = fP). (5) 
Combining now (2), (3), (4), and (5), we get the desired relation. CJ 


As mentioned in the beginning, we aim to show that the dimension of 
aff f(P%) is Ld/2]. It will follow from Theorem 17.1 that the dimension is at 
most | d/2 |. To see that it is at least |d/2], we need the lemma below. To ease 
the notation, we shall write 


m:=|(d — 1)/2], n:=|d/2]. 
Note thatd = m+n 4+ 1. 
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Lemma 17.2. For i = 0,...,, let P; be a cyclic polytope of type C(p + i, d) 
for some fixed p >d+ 1. Then the f-vectors f(P;), i= 0, ..., n, form an 
affinely independent family in R‘. 


PROOF. The f-vector of P; has the form 


, 


f(P) = ((’ , ) Lo (’ iy ) lo. A fe-1)) 


nl 


cf. Theorem 13.5 and Corollary 13.8. Saying that the f(P,)’s are affinely 
independent is equivalent to saying that the f(P,)’s defined by 


{(P) = t (’ , } ses (’ ).n(P. ve t-s0) 


are linearly independent. In terms of matrices, this is equivalent to saying 
that the (n + 1) x (d + 1) matrix whose rows are formed by the f(P;)’s has 
rank n + 1. Consider the submatrix A formed by the first n + 1 columns, i.e. 


Ais (’ r ) | 
J i=0,....n; j=0,...,n 


We shall complete the proof by showing that A is invertible. Let 


and let 


Then 


_— y pt+i\(—-pt+j\_ fit 
NO Soh kN nak} Van PP 
cf. Appendix 3, (7). Hence, C has only 1’s in the “skew diagonal” (i.e. the 
positions (i, j) with i + 7 = n) and only 0’s above. Therefore, 
|det C| = 1, 

implying that A is invertible. LJ 

It 1s easy to prove by induction on n that the matrix A in the proof above 
has determinant 1. Hence, one can prove that A is invertible without referring 


to Appendix 3, (7), if desired. 
We shall next prove: 


Theorem 17.3. The affine hull aff f(A‘) of f (A4) has dimension | d/2]. 


PROOF. Consider the system of d + 1 (homogeneous) linear equations 


d _ * 
(4 ‘hs =x, i=0,...,d, (6) 
=0 —_ 


J 


§17. The Dehn-Sommerville Relations 107 


with unknowns Xo,..., Xy_-1, Xz. Assigning the value 1 to xz we obtain a 
system of d + 1 linear equations with d unknowns Xo,..., X;—,. (Note that 
for d odd, the equation corresponding to i = d is inhomogeneous.) Theorem 
17.1 tells that for each polytope Pe P4, 


(Xo.--+5Xa-1) = (folP), -- +, fa-1€P)) 


is a solution. In other words, denoting the set of solutions by S, we have 
f (P4) < S, whence 


aff f(P4) < affS = S, 
and so 
dim(aff f(Pg)) < dim S. 

Now, it 1s easy to see that the equations (6) corresponding to odd values of i 
are independent. Since the number of odd values of i is m + 1, it follows that 
dim S <d — (m+ 1) 

=n, 
implying that 
dim(aff f(A4)) < n. 


To prove the converse, we apply Lemma 17.2. Let P; be as described there, 
and let Q; be a dual of P;,i = 0,...,n. Then f(Q;) € f(A4) by Theorems 13.5 
and 12.10. Moreover, since the f(P;)’s are affinely independent, cf. Lemma 
17.2, it follows that the f(Q;)’s are affinely independent. The number of 
f(Q,)’s is n + 1, and therefore the dimension of aff f(#4) is at least n. This 


completes the proof. CI 
For i = 0,..., d, let H; denote the set of points (xo, ..., Xg_-1) € R? such 
that 


d d ~j 
_1) =x, 


where, as in the proof above, it is understood that x, = 1. Then Hy = R¥%, 
and for i > 1, each H; is a hyperplane in R*. During the proof above, we 
showed that 


d d 
n= dim ()\ H; => dim ()\H; > dim(aff f(#4)) => n. 
i=0 i=0 

iodd 


We also have 


d 


d 

aff f(Pg) < (\H; c (\ Hi, 

i=0 i=0 
iodd 
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and we can therefore conclude that 


d d 
aff f(A4) = (\H; = () Hj. 
i=0O i=0 
iodd 


Thus, we have obtained “representations” of aff f(A4) as intersections of 
families of hyperplanes. (Since Hy = R%, this set can be omitted.) Actually, 
the “representation” of aff f(A‘) as the intersection of the odd-numbered 
Hs is “minimal” in the sense that it includes the smallest possible number of 
hyperplanes. In the following, we shall establish other such “representations.” 
We shall, however, prefer to formulate the results in terms of linear equations, 
rather than using a geometric terminology. 

A system of linear equations with d unknowns xy,..., Xg—, will be called 
a Dehn—Sommerville System for the simple d-polytopes if its set of solutions 1s 
precisely aff f(A4). A Dehn-Sommerville System containing a minimal 
number of equations is said to be minimal. It follows from Theorem 17.3 that 
this minimal number is 


d —|d/2] = |(d + 1)/21. 


Any subsystem of a Dehn-Sommerville System which is formed by 
|(d + 1)/2] independent equations is necessarily again a Dehn~Sommerville 
System (and hence minimal). 

It follows immediately from the remarks above that we have: 


Theorem 17.4. The equations 
d [d—j 
ery Thay = x i=0,...,d, 


where x4 = 1, forma Dehn—Sommerville System. The equations corresponding 
to odd values of i form a minimal Dehn—Sommerville System. 


In dealing with Dehn-Sommerville Systems it is convenient to use matrix 
notation. We shall write 


where it is always understood that x; = 1. Ifwelet A bethe(d + 1) x (d + 1) 


matrix defined by 
A:= [c »(( 5 /) 
d—i i=0O,...,d;j=0,...,d 


then we may write the Dehn—Sommerville System of Theorem 17.4 as 


Ax =X. (7) 
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Now, once we know this Dehn—Sommerville System, it is easy to produce 
new Ones: any system obtained from (7) by multiplying from the left on both 
sides by an invertible matrix will again be a Dehn-Sommerville System. We 
shall apply this procedure below. 


Theorem 17.5. The equations 


Fens = Seve! ju tO. 


where xq = 1, forma Dehn-Sommerville System. The equations corresponding 
to the values i = 0,...,m form a minimal Dehn—Sommerville System. 


ProoF. Let B be the (d + 1) x (d + 1) matrix defined by 


Bix (-»~()) | 
UL] /i=0,...,d;j=0,...,d 


Note that B is invertible, cf. Appendix 3, (11). Since x = Ax is a Dehn- 
Sommerville System, it follows that 


Bx = BAx (8) 


is also a Dehn-Sommerville System. Except for the factor (—1)', the ith 
entry on the left-hand side of (8) is the left-hand side of the ith equation in the 
theorem. In order to evaluate the right-hand side of (8), we first calculate the 
element of BA 1n the ith row and jth column. This element is 


: i+n( K j{4-J _ 7 k d-j 
2 wD (i) ae eee Seo iy) 
— (_.1\' +s 1d J 
= (-1'*% (7) 


where we have used Appendix 3, (12). Then the ith entry on the right-hand 
side of (8) becomes 


d 


it jt J _ ( _ _ +j J | 
LCV (1 or IY (- pF (1) 


which — except for the factor (— 1)'—is the right-hand side of the ith equa- 
tion. This completes the proof of the first statement. 

To see that the first m + 1 equations form a minimal Dehn-Sommerville 
System, if suffices to show that they are independent. We have proved above 
that—except for the factor (—1)'—the equations of the theorem may be 
written as 


Bx = BAx. 
We rewrite this as 
(B — BA)x = 0, 
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where 0 on the right-hand side denotes the (d + 1) x 1 zero matrix. Now, 
note that the calculation above shows that the element of BA in the ith row 
and jth column is 0 for i, j < m. In other words, the (m + 1) x (m + 1) 
submatrix of B — BA in the upper-left corner is the matrix 


Bo = (1 
l i=0,...,m;j=0,...,m 


This matrix, however, is invertible, cf. Appendix 3, (11), and therefore the 
first m + 1 equations are independent. LJ 


Theorem 17.6. The equations 


- (m+1+j\({m-—i+j 
xi = Yo j )( Jimi 
j=0 


m—i 
+ —rScn(Z coi)(" 7 i) ee i= 0,0... 


where x4 = 1, form a minimal Dehn—-Sommerville System. 


PROOF. Let B, be the (m + 1) x (d + 1) matrix defined by 


B, = 0} 
L] /i=0,...,m;j=0,...,d 


and let C be the (m + 1) x (d + 1) matrix defined by 


C:= ern J . 
d—i i=0,...,.m;j=0,...,d 


Then the minimal Dehn-Sommerville System of Theorem 17.5 may be 
written as 


By,x = Cx. (9) 


Let By and B, be the submatrices of B, formed by the first m + 1 and the 
last (d+ 1)-— (m+ 1)=n+1 columns of B,, respectively. (Then B, 
denotes the same matrix as in the proof of Theorem 17.6.) In a similar way, 
let C. and C, denote the submatrices of C formed by the first m + 1 and the 
last n + 1 columns of C, respectively. Then we may rewrite (9) as 


Xo Xm+1 
(Bo _ Co) — (C, — B,) ; . (10) 
x Xq 


m 


Now, note that Co is a zero matrix, whence By — Cy = Bo. Let 


Do i= (/ | 
LU] J i=O0,...,m;j=0,...,m 
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Then D, is in fact the inverse of By, cf. Appendix 3, (11). Therefore, multiplica- 
tion by Dp in (10) gives the new minimal Dehn-Sommerville System 


Xo Xm+1 
= D(C, _ B,) 
Xm Xa 


To see that this is the system in the theorem, we calculate the elements of 
D(C, — B,). Note first that 


C, —B,= 


cen (™ + I *) 4 anen(™ + 1 *) 
d—i I i=0,...,m;j=0,...,n 


Then the element of D,(C, — B,) in the ith row and jth column is 


. k ntkej{M+1 +] mekej(M+i 4+] 
S (core (eer (2) 
Seon" *) +. y coren()(™ 7 .) 

j(mrit+i m—-—i+j . n+kej(K m+1+j 
of NEL « Serre ste) 


cf. Appendix 3, (10). From this the statement follows immediately. LJ 


We note an interesting corollary of Theorem 17.6: 


Corollary 17.7. Let P be a simple d-polytope. Then the numbers fo(P),...,fi(P) 
are determined uniquely by the numbers f,,4,(P), ...5fa—1(P). 


Of course, all the preceding results have dual counterparts for simplicial 
polytopes. We only mention the dual of Theorem 17.1, the Dehn-Sommer- 
ville Relations for the simplicial d-polytopes: 


Corollary 17.8. For any simplicial d-polytope P we have 


d-1 d-1-j J+) _ 
» (-1) ()? 1) 14P) = 40 


jen 


fori= —-1,...,d-1 


For an entirely different approach to the Dehn—Sommerville Relations, 
see the remark at the end of Section 18. 


112 3. Combinatorial Theory of Convex Polytopes 


$18. The Upper Bound Theorem 


In this section we shall answer the following question: What is the largest 
number of vertices, edges, etc. of a simple d-polytope, d > 3, with a given 
number of facets? Moreover, we shall find out which polytopes have the 
largest number of vertices, edges, etc. The result which is known as the Upper 
Bound Theorem is a main achievement in the modern theory of convex 
polytopes; it was proved by McMullen in 1970. 


Let us begin by noting that all simple 3-polytopes with a given number of 
facets have the same number of vertices and the same number of edges: 
this follows from the “reformulation” of the Dehn-Sommerville Relations 
mentioned at the beginning of Section 17: 


fo(P) = 2f,(P) — 4, f,(P) = 3f.(P) — 6. (1) 


So, the following is only of significance for d > 4. 
Recall from Section 14 that for any simplicial neighbourly d-polytope 
P with p vertices we have 


ey~(_" jahene 


(As in Section 17, we write m:=[(d — 1)/2] and n:=|[d/2].) Moreover, for 
these values of j, no simplicial d-polytope with p vertices can have a larger 
number of j-faces. So, for 1 < j < n — 1, the least upper bound for the num- 
ber of j-faces of simplicial d-polytopes with p vertices equals 


(1) 


and this upper bound is attained by the neighbourly polytopes. Conversely, 
if P is a simplicial non-neighbourly d-polytope with p vertices, then 


p 
er<(2 | 


for j = n — 1, and possibly also for smaller values of j. 
In the dual setting, the discussion above shows that form + 1 <j<d-— 2, 
the least upper bound for the number of j-faces of a simple d-polytope with 


p facets equals 
P 
d—j} 


and that this upper bound is attained by the dual neighbourly polytopes; 
moreover, if P is a simple d-polytope with p facets which is not dual neigh- 


bourly, then 
p 
wr<(." 
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for j = m + 1, and possibly also for larger values of j. The main result of 
this section includes these statements. 
In order to state the main result, we define for j > 0 


®(d, p= ¥ (ees 5 (Ure) 2) 
i=o \VJ l i=0 J l 


With this notation the Upper Bound Theorem may be stated as follows: 


Theorem 18.1. For any simple d-polytope P with p facets we have 
f{P) < Ofd,p), f= 0,...,d-2 

If P is dual neighbourly, then 
FP) = Od, p), j=0,...,d—2. 

If P is not dual neighbourly, then 
f(P) < Dd, p), j=0O0,...,m4+ 1, 

(and possibly also for larger values of j). 


It is easy to verify that Theorem 18.1 holds for d = 3. Recall that any 
3-polytope is neighbourly, and therefore any 3-polytope 1s also dual neigh- 
bourly. So, for d = 3 the statement of the theorem amounts to saying that 
for any simple 3-polytope P with p facets we have f>(P) == ®,(3, p) and 
fi(P) = ®,(3, p). Noting that ®,(3, p) = 2p — 4 and ®,(3, p) = 3p — 6, this 
follows immediately from (1). 

Since ®,_,(d, p) = pand ®,(d, p) = 1, the first two statements of Theorem 
18.1 also hold for j = d — 1 and j = d. The proof below actually covers 
these values of /. 

The discussion preceding Theorem 18.1 shows that we must have 


oin=(," ) jom+l,...,d—-2. 


We shall return to this matter after the proof of Theorem 18.1. 

Let us also remark that Theorem 18.1 shows that the dual neighbourly 
polytopes are remarkably well equipped with faces: Among all simple 
d-polytopes with p facets, any dual neighbourly has the largest possible 
number of j-faces for all values of j between 0 and d — 2. 

Finally, let us remark that the upper bound inequality f(P) < ®(d, p) 
actually holds for any (i.e. not necessarily simple) d-polytope P with p 
facets; this is due to the fact that for any d-polytope P there is a simple d- 
polytope P’ with the same number of facets as P and as least as many j-faces 
forO0 <j <d — 2, cf. Exercise 12.4. 


PROOF. The proof is divided into three parts. In Part A we shall introduce 
certain numbers g,(P) associated with a simple d-polytope P. In Part B we 
shall obtain relations between the numbers g,(P) and the corresponding 
numbers g,(F) for facets F of P. Finally, in Part C we shall combine the results 
of Part A and Part B to obtain the desired conclusions. 
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A. In the following, let P be a simple d-polytope in R’, and let w be any 
vector in R* which is admissible for P, cf. Theorem 15.1. As described in 
Section 15, the vector w turns the non-oriented graph G(P) into an oriented 
graph &(P, w). (For graph-theoretic notions, see Appendix 2.) The following 
is an immediate consequence of Theorem 12.12: 


(a) For each vertex x of P, the sum of the in-valence of x and the out-valence 
of x equals d. 


We shall need some more definitions. A k-star,k = 0,...,d, is a set formed 
by a vertex x of P and k edges of P incident to x; the vertex x is called the 
centre of the k-star. A k-star whose edges are all oriented towards the centre 
is called a k-in-star, and a k-star whose edges are all oriented away from the 
centre 1s called a k-out-star. 

There is a close relationship between k-faces and k-in-stars (or k-out-stars): 


(b) Let x be the centre of a k-in-star, and let F be the smallest face of P con- 
taining the k-in-star. Then F is a k-face, F is the only k-face containing the 
k-in-star, and x is the bottom vertex of F. The same statement with k-in-star 
replaced by k-out-star and bottom vertex replaced by top vertex is also 
valid. 


To prove (b), we first note that F is a k-face by Theorem 12.17(a). Any other 
face containing the k-in-star must therefore have dimension > k, cf. Corollary 
5.5. To see that x is the bottom vertex of F, note that the only vertices of F 
adjacent to x are the endpoints x,,..., x, of the edges [x, x,],..., [x, x] 
belonging to the k-in-star; this follows from Theorem 12.17(b). This implies 
that x is separated from the vertices of F adjacent to x by a suitable “hori- 
zontal” hyperplane H. Theorem 11.8, applied to F, then shows that x is 
the bottom vertex of F. For k-out-stars the statement is proved in a similar 
way. 
We shall next use (b) to prove the following: 


(c) The number f (P) of j-faces of P equals the number of j-in-stars,j = 0,...,d. 


We shall prove (c) by showing that each j-face contains one and only one 
j-in-star, and each j-in-star is contained in some j-face. Let F be a j-face. Then 
each vertex of F is the centre of a unique j-star in F ; this follows from Theorems 
12.15 and 12.12. The particular j-star whose centre is the bottom vertex of F 
is clearly a j-in-star. On the other hand, the centre of any other j-in-star in F 
must also be the bottom vertex of F by (b). Hence F contains precisely one 
j-in-star. Finally, it follows immediately from (b) that each j-in-star in P is 
contained in some (in fact, a unique) j-face of P. This completes the proof 
of (c). 

Now, for i = 0,..., d, let g,P) denote the number of vertices of P whose 
in-valence equals i. The top vertex of P has in-valence 0, and it is, in fact, the 
only vertex whose in-valence equals 0, cf. Theorem 15.2. Therefore: 


(d) go(P) = 1. 
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It follows immediately from the definitions that the number of j-in-stars of 


P equals 
4 [i 
»y (Jour ). 
i=o VU 


Using (c), we then obtain: 


d 


(ec) f,(P) = d [ace for j =0,...,d. 


A:= ( 
L} /i=0,...,d:j=0,...,4 


we can rewrite (e) as a matrix identity, 


Jo(P) 


Letting 


go(P) 
=A , 
tal P) ga(P) 


Now, A is invertible. In fact, we have 


At = 9! . 
UE} /i=0,...,d;j=0,...,4 


cf. Appendix 3, (11). Therefore, the matrix identity above is equivalent to 


go(P) fo(P) 
» =A: , 
ga P) falP) 


whence 


d . 
(f) g(P) = Y(- we) ee) fori =0,...,4. 
jJ=0 
In the relations (f), the right-hand sides are certainly independent of w. 
It then follows that, although the definition of the numbers g,P) apparently 
depends on the particular choice of w, we actually have: 


(g) The numbers g(P), i = 0,..., d, are independent of w. 


It is trivial that if w is admissible for P, then —w is also admissible for P. 
When one replaces w by —w, then all orientations of the edges of P are 
reversed. In particular, vertices having in-valence i with respect to w will 
have in-valence d — i with respect to —w, cf. (a). Bearing in mind (g), it 
follows that 


(h) g(P) = gz_(P) fori = 0,..., 4. 
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Rewriting (e) as 


fp) = ¥ (jaar + y (Jar) 


1=0 J t=n+1 


m fq j 
= (jaar + yy ( Jas-AP) 
i=o VJ i=0 J 


it then follows using (h) that 


i=O i=0 


; n . m d _ ¢ 
(i) f(P)= ¥ (Jour a) ( Jace) forj =0,...,d. 


This relation shows that the numbers f,(P) can be expressed as non-negative 
linear combinations of the numbers g,(P) with i ranging only up to n. Actually, 
for; < m + 1 the coefficient of each g,(P) is >0 in at least one of the two sums 


in (i). 


B. When P is a simple d-polytope, then every facet F of P is also simple, 
cf. Theorem 12.15. Therefore, there are numbers g,F), i= 0, ..., d — 1, 
associated with F, as defined in Part A. 

In the following, let F be a facet of a simple d-polytope P. Let w be admis- 
sible for P. Then, for each vertex x of F, let the relative in-valence of x in F 
be the in-valence of x in the subgraph of &(P, w) spanned by the vertices of F; 
in other words, the relative in-valence of x is the number of edges [ x, y] of F 
oriented towards x. Now, when the vector w is admissible for P, it is also 
admissible for F. Therefore, for any vertex x of F, the in-valence of x in 
G(F, w) equals the relative in-valence of x in F. Hence: 


(j) gf F) equals the number of vertices of F whose relative in-valence is i for 
i=0,...,d—1. 


Let w be admissible for P such that each vertex of P not in F 1s below any 
vertex of F, cf. Theorem 15.3. Then the relative in-valence of a vertex x of F 
is simply the in-valence of x in G(P, w). By (4), this implies 


(k) g(F) < g(P) fori =0,...,d—1. 


Suppose that for some i, we have strict inequality in (k). Then there is at 
least one vertex x of P not in F such that the in-valence of x is i. Therefore, 
the out-valence of x is d — i, cf. (a). It then follows that x is the centre of a 
unique (d — i)-out-star. Let G be the smallest face of P containing this 
(d — i)-out-star. Using (b), it follows that G is a (d — i)-face, and x 1s the top 
vertex of G. Since x is not in F, and each vertex of F is above any vertex of P 
not in F, we see that G and F are disjoint. Now, note that G is the intersection 
of the facets containing G, cf. Theorem 10.4, and the number of such facets 
equals i since P is simple. Let these facets be F,,..., F;. Then the i + 1 facets 
F, F,,..., F; have an empty intersection since F and G are disjoint. By 
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Theorem 14.7, this implies that i + 1 > n, provided that P is a dual neigh- 
bourly polytope. In other words: 


(1) If P is a dual neighbourly polytope, then g{F) = g(P) for i = 0,...,n — 1. 


We remind the reader that in the preceding discussion, F is any facet of P. 
The following, therefore, is the converse of (1): 


(m) If P is not a dual neighbourly polytope, then there is a facet F of P such that 
g(F) < g(P) for at least one i = 0,...,n — 1. 


To prove (m), we reverse the proof of (1). If P is not a dual neighbourly 
polytope then there is ak < nsuch that certain k facets of P, say F,,..., Fy, 
have an empty intersection, whereas any k — 1 facets intersect, cf. Theorem 
14.7. Let G denote the intersection of F,,..., F,—-,. Then G is a face of P 
whose dimension equals d — (k — 1), cf. Theorem 12.14. Let w be admissible 
for P such that any vertex of P which is not in F, is below any vertex of F,,, 
cf. Theorem 15.3. Let x be the top vertex of G. Then the out-valence of x is at 
least d — (k — 1), cf. Theorems 12.12 and 12.15, and the in-valence, therefore, 
is at most k — 1, cf. (a). Denoting the in-valence of x by i, it then follows 
that i < n — 1 and g(F,) < 9,(P). 


C. Let P be a simple d-polytope with p facets, and let w be admissible 
for P. By an i-incidence, where i = 0,...,d — 1, we shall mean a pair (F, x), 
where F is a facet of P and x is a vertex of F whose relative in-valence in F 
equals i. We denote the total number of i-incidences by I;. It follows from 
(j) that 


(nhl;= YS  gf{F) fori=0,...,d—1. 


Fe Fa-1(P) 
Combining (n) and (k), we obtain: 
(0) I; < pgf{P) fori =0,...,d— 1. 
Combining (n) and (1), we obtain: 
(p) If P is a dual neighbourly polytope, then I; = pg(P) for i = 0,...,n — 1. 
And combining (n) and (m), we obtain: 


(q) If P is not a dual neighbourly polytope, then I; < pg<{P) for at least one 
i=0O,...,n— 1. 


We shall next prove: 
(r) I; = (d — Dg(P) + G+ 1)gi+ (CP) fori = 0,...,d - 1. 


To obtain this, we shall determine I; by summing over the vertices of P, 
rather than summing over the facets as we did in (n). Let x be a vertex of P. 
Then there are precisely d facets of P containing x, and by Theorem 12.12 
there are also precisely d edges of P containing x. Since each facet is simple, cf. 
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Theorem 12.15, it follows that for each facet containing x, precisely one of the 
d edges containing x is not in the facet, cf. Theorem 12.12; we shall call this 
edge the external edge of the facet. Note that, conversely, each edge containing 
x 1s the external edge of some facet containing x; this follows immediately 
from Theorem 12.17. Now, for a facet F containing the vertex x, the pair 
(F, x) 1s an i-incidence if and only if one of the following two conditions hold: 


(a) x has in-valence iin P, and the external edge of F is oriented away from x. 
(B) x has in-valence i + 1 in P, and the external edge of F is oriented towards x. 


If x has in-valence i, then there are d — i facets F such that («) holds. If x has 
in-valence i + 1, then there are i+ 1 facets F such that (/) holds. This 
proves (r). 

We next combine (0) and (r) to obtain: 


—d+i 


Pp 
gi+i1(P) < iad g(P), i=0,...,d—1. 


Since go(P) = 1, cf. (d), it follows by induction that 
—d+i-1 
(s) uP) s (° - J fori =O... 


In a similar way, combining (p) and (r), we obtain: 


(t) If P is a dual neighbourly polytope, then 
—d+i-1 
g(P) = (’ 


fori=0O,...,n. 
And, combining (q) and (1), we get: 
(u) If P is not a dual neighbourly polytope, then 
p-—d+i-1 
g(P) < ( 
for at least one i = 0,..., Nn. 
We can now complete the proof. Combining (i) and (s) we obtain 
f(P) < ®,(d, p), j=0,...,4d, 
proving the first statement of the theorem. Combining (i) and (t) we get 
f(P) = Od, p), j=0,...,d, 


when P is a dual neighbourly polytope, proving the second statement of the 
theorem. Finally, as remarked earlier, for 0 <j < m+ 1 the coefficient of 
each g,(P) is >0 in at least one of the two sums in (i). Therefore, combining 
(i) and (u) we obtain the third statement of the theorem. LC) 
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The proof above of the Upper Bound Theorem only contains one compu- 
tation, namely, the use of formula (11) from Appendix 3 leading to (f). 
However, in the proof we do not really need (f), we only need to know that 
g(P) can be expressed by the numbers f,(P) in some way. [To see this it 
suffices to know that the matrix A is invertible, and that follows immediately 
from the fact that A is an upper triangular matrix with 1’s in the diagonal. 
The explicit formula (f) is only included here because of its relevance to the 
discussion in Section 20. 

As mentioned just after the statement of Theorem 18.1, the expression 
for ®(d, p) can be simplified for m+ 1 <j <d — 2. Moreover, for j = 0 
we have a very simple expression for ® (d, p), and for the remaining values of j 
we have a reformulation which may occasionally be useful: 


Theorem 18.2. (a) The value of ®,(d, p) equals 


rr yer) 


(b) For j = 1,...,n, the value of D {d, p) equals 


PE LEI 


(c) Forj=m-+1,...,d — 2, the value of Dd, p) equals 


Pp 
d—j) 
PROOF. (a) By the definition (2) we have 


bi(d, p) = ¥ (Para Spee) 


i=0 i=0 l 


The desired expression then follows easily using Appendix 2, (9). 


(b) We can rewrite the first sum in (2) using identities from Appendix 3 as 
indicated: 


“ fi\fp—d+i-1 

S(O) 

2) oO, qifi\f4 — P\am, .,,f4—- P\ {4d -p-j-1 
= 2, »(;)( ye vr j )( n—j 


2“ + rd = Jeo +pt+jtlin—-j- ' 


J 
J J 


Hence, we have the second term in the desired expression. 
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We next prove that for 0 < j < d we have 


0n 


The validity of (3) is proved using identities from Appendix 3 as indicated: 


<j (d —i\(p-—d+i-1\@ _-j( ~F-1\fp-d+i-1 
x, ( J )( i Je xcw (| 
(2) d-i-j —j-—l _ f—-pt+d 
= A (i!) H( | 

— “yes” cama jal 

ae (4 —j) i 


as desired. 
Using (3) we can now rewrite the second sum in (2): 


x \(' ” * i — ' 
EC) -> (“Po 
Or) BC I) 


Hence, we have the two remaining terms in the desired expression. 

(c) Although we already know that the statement is true, we would like to 
give a direct proof. For j > n+ 1, each term in the first sum in (2) has the 
value 0. In the second sum, all terms corresponding to values of i that are 
> d — j also have the value 0. Therefore, 


®(d, p) = y (’ . \(' Tati ') 
i=0 J l 


Combining with (3) above, we then get 


When m = n, this completes the proof. When m = n — 1, it remains to 
consider the value j = m+ 1=n. However, this is easily handled by 
returning to the expression for ® (d, p) in case (b). The details are left to the 
reader. LI 
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By duality, we also have an Upper Bound Theorem for the simplicial 
d-polytopes. It may be stated as follows: 


Corollary 18.3. For any simplicial d-polytope P with p vertices we have 
ff{P) < ®y_ ;~ (d, p), j=1,...,d—1. 
If P is neighbourly, then 


ff{P) = Dy_ 1 - fd, p), j=il,...,d—-—1. 
If P is not neighbourly, then | 
ff{P) < Dy_,~ fd, p), j=n-—l,...,d—1, 


(and possibly also for smaller values of j). 


Finally, it is interesting to note that (f) and (h) in the proof of Theorem 
18.1 show that 


¥ 9 (!) 400 = ¥ (1 J ) ft), i=0,...,d, 
j=0 i j=0 d—i 


1.e. (fo(P), ..., fa—1(P)) satisfies the Dehn—Sommerville System of Theorem 
17.5. Hence, we have an independent proof of the Dehn—~Sommerville 
Relations which does not rely on Euler’s Relation. 


819. The Lower Bound Theorem 


In the preceding section we determined the largest number of vertices, edges, 
etc. of a simple d-polytope, d > 3, with a given number of facets. In this section 
we Shall find the smallest number of vertices, edges, etc. The result which is 
known as the Lower Bound Theorem was proved by Barnette in 1971-73. 
Like the Upper Bound Theorem, it is a main achievement in the modern 
theory of convex polytopes. 


As we saw at the beginning of Section 18, all simple 3-polytopes with a 
given number of facets have the same number of vertices and the same 
number of edges. So, as in the case of the Upper Bound Theorem, the problem 
is only of significance for d > 4. 

We define 


(d — 1)p — (d + 1) — 2), j= 09; 


—{. d — 1 — )), = 1,...,d—2. 
(ae jt ( Ds J 
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Note that 


d+1 
_»4(d = — 
Pa-2(d, p) = dp (7) 


= dp — (d? + ay/2. 


With this notation the Lower Bound Theorem may be stated as follows: 


Theorem 19.1. For any simple d-polytope P with p facets we have 


f«{P) = Od, p), j=O0,...,d — 2. 
Moreover, there are simple d-polytopes P with p facets such that 


I(P) = Of4, P), J =9,...,d — 2. 


Since (3, p) = 2p — 4 and (3, p) = 3p — 6, we see immediately as in 
the case of the Upper Bound Theorem that the theorem is true for d = 3, in 
fact, with equality for all simple polytopes. 

Before proving Theorem 19.1 we need some notation and some pre- 
paratory lemmas. 

We remind the reader that a facet system in a polytope P is a non-empty set 
S of facets of P. When Y is a facet system in P, we denote by (SY) the union 
of the subgraphs Y(F), Fe Y, of YP), and we say that ¥ is connected if 
GS) is a connected graph. These concepts were introduced in Section 15, 
where we also proved some important results about connectedness prop- 
erties of GS). 

When -¥ is a facet system in P and G1s a face of P, then we shall say that 
G isin ¥ or Gis a face of Y, if G is a face of some facet F belonging to Y. 
In particular, the vertices of Y are the vertices of the facets in Y. 

In the following, we shall restrict our attention to facet systems in simple 
polytopes. Let Y be a facet system in a simple d-polytope P, and let x be a 
vertex of Y. Then x is a vertex of at least one member F of Y. Therefore, the 
d — 1 edges of F incident to x are edges of YS. If the remaining edge of P 
incident to x is also in , we shall say that x is internal in ¥ or that x is 
an internal vertex of Y. If, on the other hand, the remaining edge of P 
incident to x is not in , we shall say that x is external in ¥ or that x is an 
external vertex of Y. In other words, a vertex x of ¥ 1s external if and only if 
it is a vertex of only one member of Y. 

The first lemma ensures the existence of external vertices under an obvious 
condition. (In the following, we actually need only the existence of just one 
external vertex.) 


Lemma 19.2. Let Y be a facet system in a simple d-polytope P such that at 
least one vertex of P is not in S. Then ¥ has at least d external vertices. 
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ProoF. If all vertices of Y are external, then each member of YY contributes 
at least d external vertices. Suppose that some vertex z of ¥ is internal. By 
the assumption we also have a vertex y not in & We then use the d-con- 
nectedness of GP), cf. Theorem 15.6, to get d independent paths joining y 
and z. Traversing the ith path from y to z, let x; be the first vertex which 1s in 
SF. Then the preceding edge is not in Y, and therefore x; is external in . Since 
the x,’s are distinct, we have the desired conclusion. LJ 


During the proof of Theorem 15.7 it was shown that if Y is a connected 
facet system in P and has at least two members, then there is a member F, 
of Y such that Y\ {Fo} is again connected. When P is simple, we have the 
following much stronger result: 


Lemma 19.3. Let S be a connected facet system in a simple d-polytope P. 
Assume that at least one vertex of P is not in S, and that FY has at least two 
members. Then there is a pair (X,, Fo) formed by an external vertex x, of S 
and the unique member Fy of Y containing Xo such that the facet system 
S\ {Fo} is again connected. 


PROOF. We know from Lemma 19.2 that Y has external vertices. Let (x,, F;) 
be a pair formed by an external vertex x, of “ and the unique member F, 
of Y containing x,. Suppose that Y\{F,} is not connected. Let /, be a 
maximal connected subsystem of Y\{F,}. We shall prove that then there is 
another pair (x,, F,) such that Y, U{F,} is a connected subsystem of 
S\{F,}. In other words: if A\{F,} is not connected, then we can replace 
(x,, F,) by some (x,, F,) in such a manner that the maximum number of 
members of a connected subsystem of A\ {F,} 1s larger than the maximum 
number of members of a connected subsystem of Y\{F,}. Continuing this 
procedure eventually leads to a pair (x), Fo) with the property that S\ {Fo} 
is connected. 

Now, let (x,, F,) and Y, be as explained above. We first prove that 
SF, U{F,} is connected. Let y be any vertex of Y,; note that y ¥ x, since F, 
is the only member of Y containing x, and F, ¢ Y,. By the connectedness of 
F there is a path in YS) joining y and x,. Traversing this path from y to x,, 
let F be a member of containing the first edge of the path not in Y,. 
(Since x, 1s not in Y,, such an edge certainly exists.) Then clearly Y, U {F} 
is connected. By the maximality property of “, we must have F = F,, 
whence Y U {F,} 1s connected, as desired. Let A) := S\(Y, U {F,}). Then 
SF’, is non-empty, possibly disconnected. By Lemma 19.2, 5, has external 
vertices. Not every external vertex of ', can be in F,. For then every path 
joining a vertex of / and a vertex of P not in.“, would have to pass through 
a vertex of F,, whence the subgraph of Y(P) spanned by ext P\ext F, would 
be disconnected, contradicting Theorem 15.5. Let x, be an external vertex of 
F', not in F,, and let F, be the unique member of “{, containing x,. Then 
actually x, is external in Y. For if not, then x, would have to be a vertex of 
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some member F of .”,, since F, is the only member of “ containing x), 
and x, is not in F,; but then Y, U {F,} would be connected, contradicting 
the maximality property of Y,. Hence, x, is external in Y, the facet F, is the 
unique member of Y containing x,,and “, U {F,} is aconnected subsystem 
of Y\ {F,}, as desired. L] 


Lemma 19.4. Let Y be a connected facet system in a simple d-polytope P. 
Assume that at least one vertex of P is not in ¥, and that ¥ has at least two 
members. Let (Xo, Fo) be as in Lemma 19.3. Then at least d — 1 vertices of P 
are internal in S but external in S\ {Fo}. 


PRooF. By the connectedness of Y, there is a member F of Y with F # F, and 
FOF, # @. Then by Theorem 12.14, the face F ~ Fy. has dimension d — 2, 
whence F and Fy have at least d — | vertices in common. Being vertices of 
two members of , such d — 1 vertices are all internal in Y. So, if they are 
all external in Y\ {Fo}, we have the desired conclusion. If they are not all 
external in /\ {Fo}, one of the vertices, say y, is internal in Y\ {Fo}. In 
particular, y # Xy. Then by Theorem 15.7 there are d — 1 independent paths 
in YF) joining xX, and y. Traversing the ith path from x, to y, let x; be the 
first vertex which is in Y\ {Fo}. Then the preceding edge [u;, x;] is not in 
SF\ {Fo}, and therefore x; is external in Y\ {Fo}. In particular, x; # xy and 
x; # y. Moreover, since [u;, x;] is not in Y\ {Fo}, it must be in F,, whence x; 
is a vertex of Fy. Since x; is also a vertex of Y\ {Fo}, we see that x; belongs to 
at least two members of , showing that x; is internal in Y. In conclusion, 
the d — 1 vertices x,,..., xy_, are internal in Y but external in. Y\{F 5}. 


Lemma 19.5. Let Y be a facet system in a simple d-polytope P such that at 
least one vertex of P is not in Y. Then there are at least d facets G,,..., Gq of 
P such that G,,...., Gg are not in F but each contains a (d — 2)-face which 
is in SF, 

Proor. Let x be a vertex of P not in Y. Let Q be a dual of P in R% and let w 
be an anti-isomorphism from (A(P), <) onto (F(Q), <). Writing 


SF = tF,,..., Fm, 


x is not a vertex of any of the F;’s, whence the facet w({x}) of O does not con- 
tain any of the vertices w(F;) of Q, cf. Theorem 9.8. Let z be a point of R? 
outside Q but “close” to w({x}) such that every vertex of Q is also a vertex 
of Q’:= conv(Q VU {z}); then the vertices of Q’ are the vertices of Q plus the 
vertex z and the edges of Q’ are the edges of Q plus the edges [z, u], where 
ueext w({x}). (Supposing that o € int P, one may take Q’ to be the polar of a 
polytope obtained by truncating the vertex x of P, cf. Section 11.) By Theorem 
15.6 there are d independent paths in Y(Q’) joining the vertices z and W(F,). 
Traversing the ith path from z to W(F,), let y; be the vertex preceding the 
first of any of the vertices W(F,), ..., W(F,,) on the path. Then by duality, 
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wo tdy.},....W *Cya}) are d facets of P not in & each having a (d — 2)- 
face in common with some member of /%. L] 


We are now in position to prove the Lower Bound Theorem: 


Proor (Theorem 19.1). We divide the proof into four parts. In Part A we 
prove the inequality for j = 0, and in Part B we prove the inequality for 
j = d — 2;here the lemmas above are used. In Part C we cover the remaining 
values of j; the proof is by induction. Finally, in Part D we exhibit polytopes 
for which we have equality. 


A. Wechoose a vertex x of P and let 
S = {FEF y_(P)|x € F}. 


Then G&(./) is the subgraph of GP) spanned by ext P\{x}, whence, by 
Theorem 15.5, Y is a connected facet system. The number of members of # 
is p — d. 

Only one vertex of P is not in , namely, the vertex x. The d vertices of P 
adjacent to x are external vertices of , and they are the only external vertices 
of Y. Hence, the number of internal vertices of F is fo(P) — (d + 1). 

If p = d + 1, then P is a d-simplex and the inequality holds with equality. 
If p > d + 2, we remove facets from Y one by one by successive applications 
of Lemma 19.3. At each removal, at least d — 1 vertices change their status 
from internal to external by Lemma 19.4. After p — d — 1 removals, we end 
up with a one-membered facet system. The total number of vertices which 
during the removal process have changed their status is therefore at least 


(p —d — 1d — 1). 
Since the number of internal vertices equals fo(P) — (d + 1), it follows that 
fo(P) - (d+ 1) 2 (p—d— Id — 1), 
whence 
fo(P) = (d — Ip — (d+ fd — 2), 


as desired. 


B. This part is divided into two steps. We first prove that if there is a 
constant K depending on d only such that 


fa—2(P) = dfg_,(P) — K (1) 
for all simple d-polytopes P, then the desired inequality 
fa-2(P) = dfg—,(P) — (d* + d)/2 (2) 


must hold. Then, in the second step, we show that (1) holds with K = d? + d. 
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Suppose that the inequality (2) does not hold in general. Then there is a 
simple d-polytope P in R@ such that 


fa-2(P) = dfy-,(P) — @? + d)/2 — 4 
for some r > 0. Let Q be a dual of P in R“. Then Q is a simplicial d-polytope 
with 
fi(Q) = dfo(Q) — (d* + d)/2 —r. 


By Theorem 11.10 we may assume that there is a facet F of Q such that the 
orthogonal projection of R4 onto the hyperplane aff F maps Q\F into ri F. 
Let Q’ denote the polytope obtained by reflecting Q in aff F. Then Q, := 
OU Q' is again a d-polytope by the property of F. It is clear that Q, 1s 
simplicial. Since F has d vertices, we have 


fo(Q1) = 2fo(Q) — d, 


and since F has 


d 2 
(;) = (d° — d)/2 


edges, we have 
f,(Q1) = 2f(Q) — (@* — a)/2. 
We then get 
fi(Q1) = 2(dfo(Q) — (d* + d)/2 — r) — d* — a)/2 
= df,(Q,) — (a? + d)/2 — 2r. 
Let P, be a dual of Q,. Then P, is a simple d-polytope with 
fa—2(P 1) = dfy—,(P,) — (d? + d)/2 — 2r 


This shows that P, fails to satisfy (2) by at least 2r faces of dimension d — 2. 
Continuing this construction we conclude that no inequality of the form (1) 
can hold for all simple d-polytopes. This completes the first step. 

To carry out the second step, let P be any simple d-polytope, and let 
p:= fy-,(P). Let x and ¥ be asin Part A. If p = d + 1, then P is a d-simplex, 


whence 
d+1 
fa-2(P) = ( ¥ ) 


= d(d + 1) — (d? + ay/2 
= dp — (d?2 + d)/2 
> dp — (d2 +d), 


as desired. For p > d + 2, we shall remove the facets in Y one by one by 
successive applications of Lemma 19.3 as we did in Part A. Let F; denote the 
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ith member of Y to be removed, let x; denote a corresponding external vertex 
of S\{F,,..., F;_,} contained in F;, and let 


Ser=a{F,OF |F,AF; #4 O,j=itl,..., ph, i=l,...,.p—d—-l. 


Then S; is a facet system in F,, cf. Theorem 12.14. 

Now, let us say that a (d — 2)-face G of F; is of type 1 in F; 1f G is not in 
SF; but some (d — 3)-face of G is in Y;. Lemma 19.5 can be applied to the 
facet system S; in F;, for x; is a vertex of F; not in Y;. Asa result we get d — 1 
(d — 2)-faces of F; of type 1. Note that a (d — 2)-face of type 1 in F; is not a 
face of any F, with; > i. 

Fori=1,...,p—d-— 1, let 


qi= maxi jli<j, FiO Fj # O}. 


Then G; := F; 0 F,,18 a(d — 2)-face of F; which we shall call a (d — 2)-face of 
type 2 in F;. Note that F; and F,, are the only facets of P containing G;, 
cf. Theorem 12.14, that G, 1s not at the same time of type | 1n F;, and that G; 
is neither of type | nor type 2 in F,,.. 

The discussion above now shows that for i= 1, ..., p—d-— 1, the 
number of (d — 2)-faces contributed by F; is at least d, namely, d — 1 of 
type 1 and one of type 2. Therefore, the total number of (d — 2)-faces of P 
is at least 


(p — d — 1)d = dp — (d? + d), 


as desired. 


C. Using induction on d we shall prove that the inequality holds for the 
remaining values of j, namely, j = 1,...,d — 3. We first note that for d = 3 
there are no such remaining values; this ensures the start of the induction. 
So, let d > 4 and assume that the inequality holds for dimension d — 1 and 
j=1,...,(d — 1) — 3. Let P be a simple d-polytope with p facets, and let j 
have any of the values 1,..., d — 3. By a j-incidence we shall mean a pair 
(F, G) where F is a facet of P and Gis a j-face of F. (This notion of incidence 
differs from the one used in the proof of the Upper Bound Theorem.) It is 
clear that the number of j-incidences equals 

y SF). 
FeFa-1(P) 
Moreover, since each j-face of P is contained in precisely d — j facets, the 
number of j-incidences also equals (d — j) f,(P). Hence, 


(d-jf(P)= )d SF). (3) 


Fe Fa-i(P) 


We next note that for any facet F of P we have 


d—1 d 
IAF) = ( . 1) fan) - (, ‘ Jc -2-)) (4) 
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in fact, for j = 1, ..., d — 4 this follows from the induction hypothesis 
applied to F, and for j = d — 3 it follows from the result of Part B applied to 
F. Combining (3) and (4) we obtain 


d— | 
@-)fP> > (ies) (S aman 


Fe Faq-(P) 
-(75 1) yf ©)-(,% \c 2-j) Yi 
jt rect uwy  ” 1 M pe H oP 


_ (d-1 d | 
= (" 1), 2 fe 2(F ~(, t1)@ = 2D 


Here 


Y fy-2(F) = 2f,-2(P) 


FeFa_-1(P) 


since each (d — 2)-face of P is contained in precisely two facets. Hence, 
@d—pfPy>(“~ tly -(.% \a-2-2 
DS j Via d-2 j+l JP: 


We next apply the result of Part B to P, obtaining 


| d—1 d+1\ d | 
(d — j)f(P) = (" (én - (i ))- (,: d= 2 =e 


An easy calculation shows that the right-hand side of this inequality may be 


rewritten as 
d d+ 1 
(d -o(( n Je (" ea — 1 -o) 


Cancelling the factor d — j, we obtain the desired inequality. 


D. It is easy to see that we have equality for all j when P is a d-simplex. 
Truncation of one vertex of a simple d-polytope P with p facets produces a 
simple d-polytope P’ with p + 1 facets, with 


(5) 


more j-faces than P for 1 <j < d — 2, and withd — 1 more vertices than P, 
cf. Theorem 12.18. It is easy to see that if we have equality for P, then we also 
have equality for P’. Hence, the desired polytopes may be obtained from a 
d-simplex by repeated truncation of vertices. This completes the proof of 
Theorem 19.1. LJ 


It would be desirable to have a more direct proof of the Lower Bound 
Inequalities than the one given in Parts A, Band C above. As a beginning, one 
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could think of a direct proof of the inequality for j = d — 2, replacing the 
two-step proof of Part B. In the second step we proved that (1) holds with 
K = d* +d. Compared to the desired inequality, the deficit amounts to 
(d? + d)/2. However, when counting the (d — 2)-faces we did not count 
those containing x; the number of such (d — 2)-faces equals 


d 2 
({)-¢ — d)/2. 


This improvement does not yield the desired inequality, but it reduces the 
deficit to d. 

In Part D of the proof of Theorem 19.1, we showed that we have equality 
for the truncation polytopes, i.e. the polytopes obtained from simplices by 
successive truncations of vertices. Ford > 4itis known that if f{P) = ~,(d, p) 
for just one value of j, then P must be a truncation polytope. For d = 3 the 
situation is different. As we know, all simple 3-polytopes yield equality. 
On the other hand, there are simple 3-polytopes which are not truncation 
polytopes, for example, the parallellotopes. 

In Section 18 it was indicated that the upper bound ®,(d, p) is also valid 
for non-simple polytopes. In contrast to this, little seems to be known about 
lower bounds for non-simple polytopes. 

In its dual form, the Lower Bound Theorem may be stated as follows: 


Corollary 19.6. For any simplicial d-polytope P with p vertices we have 
IP) = Qa-1-AdsP» fe lyeed 1. 
Moreover, there are simplicial d-polytopes P with p vertices such that 


S(P) = Ga-1-fd,p), j= l,...,d—1. 


Equality in Corollary 19.7 is attained by the duals of the truncation 
polytopes, and, ford > 4, only by these. They are the polytopes obtained from 
simplices by successive addition of pyramids over facets; they are called 
stacked polytopes. 

It is interesting to note that the Lower Bound Inequalities are closely 
related to inequalities between the numbers g,(P) introduced in Section 18. 
For details, see Section 20. 


§20. McMullen’s Conditions 


At the beginning of Section 16 it was indicated that it is not known how to 
characterize the f-vectors of d-polytopes among all d-tuples of positive 
integers. However, the more restricted problem of characterizing the f- 
vectors of simple (or simplicial) d-polytopes has recently been solved. It was 
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conjectured by McMullen in 1971 that a certain set of three conditions would 
characterize the f-vectors of simplicial d-polytopes. In 1980, the sufficiency 
of McMullen’s conditions was established by Billera and Lee, and the 
necessity was established by Stanley. We shall briefly report on these funda- 
mental results, but we shall not be able to include the proofs. As in the 
preceding sections, we shall express ourselves in terms of simple (rather than 
simplicial) polytopes. 


We begin by introducing some notation. For any d-tuple f = (fo,...,fa—1) 
of positive integers we define 


d . 
g()= Y(- v() fy = O..0d, 


where, by convention, we always put f,; = 1. Note that when fis the f-vector 
of a simple d-polytope P, then g,( f) is just the well-known number g,(P), cf. 
statement (f) in the proof of Theorem 18.1. Note also that by the argument 
leading to this statement, 


i=o \J 


We need some more notation. Let h and k be positive integers. Then using 
induction on k, it is easy to see that there exist uniquely determined positive 
integers rp, r,,..., 7, Such that 


ro > ry > > rye k-qe1 


(tle rrh 9 


In fact, ro is the largest integer such that 


ro 
h> ("?}. (3) 


r, is the largest integer such that 


(aCe) 


(unless we have equality in (3) in which case q = 0), etc. The representation 
(2) is called the k-canonical representation of h. 

Given the k-canonical representation (2) of h, we define the kth pseudo- 
power h‘*® of h by 


{ky ro + 1 r, +1 -_ r,t+1 
" (er) + k JT OTN @g 4a) o) 


and 
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Note that at the same time, (4) is the (k + 1)-canonical representation of 
h‘ It is easy to see that the kth pseudopower is monotone: 


h, <h, eh? < hs”. 
The definition of h‘”’ is extended to h = 0 by letting 
0 = 0, 


We can now formulate the characterization; recall that m:=|(d — 1)/2| 
and n := |d/2|. The conditions (a)-—(c) are McMullen’s Conditions. 


Theorem 20.1. A d-tuple f = (fo,..., fa—1) of positive integers is the f-vector 
of a simple d-polytope if and only if the following three conditions hold: 


(a) g(f) = ga-Af) fori = 0,...,m. 
(b) g(J) S gi+i(f) fori = 0,...,n — 1. 
(c) gi+i(f) — 91) S$ GAP) — gi-s PY? fori = 1,...,0 — 1. 

We know from the preceding sections that the Dehn-Sommerville 
Relations, the Upper Bound Inequalities and the Lower Bound Inequalities 
hold for f-vectors of simple polytopes. Therefore, it follows from Theorem 20.1 
that if (a)-(c) hold for some f = (fo,..., fg-1), then the Dehn-Sommerville 
Relations, the Upper Bound Inequalities and the Lower Bound Inequalities 
must also hold for f. We shall see how this can be demonstrated. (This may 
give the reader some idea of the significance of the conditions (a)-(c).) 

We first note that by the definition of g,(f), condition (a) is equivalent to 
saying that (fo,...,/,—,) satisfies the Dehn—Sommerville System of Theorem 
17.5 which in turn is equivalent to saying that (fo, ..., f,-.,) satisfies the 
Dehn-Sommerville Relations. 

To deduce the Upper Bound and Lower Bound Inequalities we need the 
observations that 


gf) = 1 


and 
ga-1(f) = fa-1 — 4. 
By (a), we then also have 


golf) = 1 


and 
gi(f) = fa-1 — 4. 
We begin with the Lower Bound Inequalities. By (a) and (b), 


g(f)<sg{f) t=1,...,d-1. (5) 
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Using (1), (5) and the observations above, we have 


= golf) + Yo) + 940) 


> golf) + d—- Dgi(f) + gdPS) 
=~1+(d—-1f..,-a +1 
= (d — I)fa-, — (d+ 1d — 2), 


as desired. For j = 1,..., d — 2 we have in a similar manner 
d . 

X (| a (f) 

d-1 j d 

| (Jats )+ (‘oar ) 


> y (Jan + (‘joa 


“es ()+() 
Now, by Appendix 3, (9) 


y V 30) 


jJ+t(d-1-j+l d 
d—1l1-j -(," 4} 


heir A(,%,)+(0) 


d+ 1 
=(,6 ,)m- (OF aot ea. 


as desired. Hence, the Lower Bound Inequalities hold. 
To deduce the Upper Bound Inequalities, we first prove that 


. a-1 ~€@+i-I| 
gif) — gi-s(f)’ < (’ on ) i=l1,...,.n—1. (6) 


For i = 1, we have 


oS 
fl 


whence 


gi(f) — gf) = fa-1 — da — 1. 
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The 1-canonical representation of f;_, — d — 1, of course, is given by 


fa-1 —d—- ' 


fis a -1=( 1 


Hence, 


ee. 
ain - ary? = ("5% 


proving (6) for i = 1. To prove (6) in general, we use induction. Suppose 
that (6) holds for i. Using (c) and the monotonicity of the pseudopower we 
then have 


(gi+i(f) - gd fy"? S (git) — gi-(f)?)'* 1> 


__ - <i+t1> 
<(" d+i ' | 
i+ 1 


To find the (i + 1)th pseudopower of 


fi-y-dt+i-1 
i+ 1 


we need the (i + 1)-canonical representation. This, of course, is given by (!) 


fy-1-d+ti-1 = (fant 
i+1 7 i+1 


fay -d+i-1\P (fiy—dti 
i+ 1 7 i+2 


proving (6) fori + 1. 
Using (6) we shall next prove that 
fa-1 ~— d + 1 —_ 1 


U 


Hence 


wits ( ) i=0,...,n. (7) 


This is certainly true for i = 0, 1, in fact with equality. We prove it in general 
by induction. Supposing that it holds for i, we have by (c) and (6), 


gis) < 94S) + GS) -— 91-1 KX? 


c(t Ny (eet 
l i+ 1 


_ fa-, ~dt+i 
7 i+1 ’ 


as desired. 
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To complete the proof, note that using (1) and (a) we have 


n 


mod j 
fi = 3 (Jaan + 2X ( j Yat 


for j = 0,...,d. Combining with (7), we get 


isd (Nene ay (Sone) 


for j = 0,..., d, which is the desired inequality. 


APPENDIX 1 
Lattices 


A relation < on a non-empty set M is called a partial order if it is reflexive, 
anti-symmetric and transitive, 1.e. if 


x XxX, 
< 


XRyVyAVRX>PX=y), 


and 
XRVAVRIZ>SXR2Z 


for all x, y,z€ M.A partially ordered set is a pair (M, =), where M is a non- 
empty set and ~ is a partial order on M. 

In the following, let (M, <) be a partially ordered set, and let N be a 
subset of M. An element x € M is called a lower bound of N if x < y for all 
yeéN. Similarly, x is called an upper bound if y < x for all ye N. An element 
x € M is called the greatest lower bound of N if x is a lower bound of N and 
z <x for any other lower bound z. The greatest lower bound of N is unique 
if it exists; it is denoted by inf N. Similarly, x is called the least upper bound 
of N if x is an upper bound of N and x = z for any other upper bound z. The 
least upper bound is unique if it exists; it is denoted by sup N. 

A partially ordered set (M, <) is called a lattice if inf N and sup N exist 
for each non-empty finite subset N of M. If inf N and sup WN exist for any 
subset N of M, then the lattice (M, =) is called a complete lattice. 

Any finite lattice is complete. 

If (M, <) is a partially ordered set such that inf N exists for all subsets N 
of M, then, in fact, (M, <) is a complete lattice. The same applies to sup N. 

Let (M, X) be a lattice, and let M’ be a non-empty subset of M. Then the 
partial order < on M induces a partial order on M’ which we shall again 
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denote by <. We shall say that the partially ordered set (M’, <) is a sub- 
lattice of the lattice (M, <) ifinf N € M’ and sup N € M’ for each non-empty 
finite subset N of M’. (Here, of course, inf N and sup N mean the greatest 
lower bound and least upper bound, respectively, of N in (M, <).) Then 
(M", <) 1s also a lattice. Note that in the definition of a sublattice, we require 
more than just (M’, X) being a lattice. 

A mapping ¢ from one lattice (M,, <) onto another lattice (M,, <) is 
called an isomorphism when it is one-to-one and we have x < y if and only if 
P(x) X gy) for all x, ye M,. If there exists an isomorphism from (M,, <) 
onto (M,, X), then we shall say that (M,, <) and (M,, <) are isomorphic. 
A mapping w from (M,, <) onto (M,, <) is called an anti-isomorphism when 
it is one-to-one and we have x < y if and only if W(y) < W(x) for all x, ye My. 
If there exists an anti-isomorphism from (M,, <) onto (Mj, <), then we 
shall say that (M,, <) and (M,, XS) are anti-isomorphic. 

Note that an isomorphism 9 preserves inf and sup, i.e. p(inf N) = inf @(N) 
and g(sup N) = sup g(N), whereas an anti-isomorphism w reverses inf and 
sup, Le., W(inf N) = sup W(N) and Y(sup N) = inf WN). 


APPENDIX 2 


Graphs 


The intuitive picture of a (non-oriented) graph is that of a finite set of “ver- 
tices” and a finite set of “edges,” each edge “joining” two distinct vertices and 
each two distinct vertices being joined by at most one edge. Formally, this 
may be expressed as follows: a (non-oriented) graph is a triple T = (V, E, y), 
where V (called the set of vertices of T) is a non-empty finite set, E (called 
the set of edges of I) is a set (necessarily finite), and y (called the incidence 
relation of T) is a one-to-one mapping from E onto a subset of the set of all 
sets {x, y} of two distinct elements of V. 

When x and y are distinct vertices of a graph T = (V, E, y) and e is an 
edge of I such that y(e) = {x, y}, then we shall say that e joins x and y, that x 
and y are the endvertices of e, that x and y are incident to e, and that e is 
incident to x and y. When x and y are distinct vertices of joined by an edge, 
then we shall say that x and y are adjacent. The number of edges incident to a 
vertex x, i.e. the number of vertices adjacent to x, is called the valence of x. 

A path in a graph IT is a finite sequence of the form 


X15 C1. Xa5 C25-- +5 Cn— 19 Xns 


where the x,’s are vertices of I’, and the e,’s are edges of I’ such that each e; 
joins x; and x;,,. The path is said to join x, and x,, and x, and x, are called 
the endvertices of the path. For technical reasons we allown = 1,1.e. we allow 
trivial paths consisting of one vertex and no edges. A collection of paths 
joining two vertices x and y is called independent if x and y are the only 
vertices common to any two of the paths in the collection. 

Two non-adjacent vertices x and y of a graph are said to be separated by a 
set W of vertices if every path joining x and y must contain a vertex of W. 

A graph is said to be connected if any two distinct vertices can be joined 
by a path. A disconnected graph is one which is not connected. A graph is said 
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to be k-connected (where k is a positive integer) if it has at least k + 1 vertices 
and any two distinct vertices can be joined by at least k independent paths. 
(Except for the trivial graphs with just one vertex, 1-connectedness is the 
Same as connectedness.) 

By a subgraph of a graph T = (V, E, y) we mean a graph I” = (V’, E,, y’) 
such that V’c V, E’ c E and y‘(e) = ye) for ee E’. In general, each non- 
empty subset V’ of the vertex set V of T is the vertex set of several subgraphs 
of I’: each subset E’ of the edge set E of I with the property that it only 
contains edges of I’ joining vertices in V’ is the edge set of a subgraph with 
V’ as the vertex set. If E’ contains all the edges of I’ joining vertices in V’, we 
call the resulting subgraph the subgraph spanned by V’. 

Two non-adjacent vertices x and y of a graph I = (V, E, y) are separated 
by a set W of vertices (in the sense described above) if and only if the subgraph 
of I spanned by V\ W is disconnected. 

A path in a graph I may be considered as a subgraph. In general, it is not 
spanned by its set of vertices. 

Let I; = (V;, E;, 7), i= 1,...,n, be subgraphs of a graph I = (V, E, y). 
Let 


n 


Ve= UY, E= UE,  y(e):=yle), eck. 
i=1 
Then I” = (V’, E’, y’) is a subgraph of I; we shall call it the union of the sub- 
graphs I’; and denote it by, UT, U---UT,,. 
In the main text we shall use the following two important connectedness 
results: 


~ 
II 
pee 


Theorem A2.1. A graph T = (V, E, y) with at least k + 1 vertices is k-con- 
nected if and only if for each k — 1 vertices x,,...,X,—, of T, the subgraph TY’ 
of I’ spanned by 


V’ = V\{x1, sey X,-1 


is connected. 


Theorem A2.2. Let I’, and I, be k-connected subgraphs of a graph YT. If T; 
and 1, have at least k vertices in common, then their union T, UT, is also 
k-connected. 


The proofs of these two theorems will be given below. Theorem A2.2 is an 
easy consequence of Theorem A2.1. The main difficulty in proving Theorem 
A2.1 is taken care of by the following lemma: 


Lemma A2.3. Let x and y be non-adjacent vertices of a graphT. If the number of 
vertices of 1 needed to separate x and y equals k, then there are k independent 
paths in T joining x and y. 
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ProoF. It is trivial that the statement is true when k = 1. Suppose that it is 
not true for all k. Let ky be the smallest value of k for which the statement is 
not true. Let I) be a graph with the minimum number of vertices such that 
for appropriate non-adjacent vertices x, and y, of Ty, the number of vertices 
needed to separate x, and yy equals ky, whereas the maximum number of 
independent paths joining x, and yy is at most kg — 1. By removing “super- 
fluous” edges from I, if necessary, we may, in addition, assume that any 
graph I) obtained from I’) by removing one edge has the property that only 
ky — 1 vertices are needed to separate x, and yo in To. 
We first prove: 


(a) No vertex of Ty is adjacent to both Xo and yo. 


Suppose that a vertex v is adjacent to both xo and y,. Let I'(v) denote the 
subgraph of Ig spanned by all vertices of Ig except v. Then clearly ky — 1 
vertices are needed to separate x, and y, in I(v). By the minimality property 
of ky we then see that there are k,y — 1 independent paths in I'(v) joining x, 
and y,. Along with the path whose vertices are x9, v, Vo, this makes a total of 
ky independent paths in I’) joining x, and yg, a contradiction. 

We next prove: 


(b) Let W be any set of ky vertices of 9 separating Xp) and yo. Then either 
every vertex in W is adjacent to X9, or every vertex in W is adjacent to yo. 


If for some v € W, every path joining x, and y, passing through v contained 
at least one more vertex from W, then already W\ {v} would separate x, and 
yo, acontradiction. Therefore, for each v € W there is at least one path joining 
Xg and yy such that v is the only vertex from W on that path. In particular, for 
each v € W there 1s a path joining x, and v such that v is the only vertex from 
W on the path. The union of all such paths is a subgraph of I.,. Adding to this 
subgraph the vertex yo plus ky “new” edges, each joining y, to a vertex in 
W, we obtain a new graph which we shall denote by I'(x,). Changing the 
roles of X9 and yg, we obtain in a similar manner another new graph I(yo). 
Note that both I(x 9) and I'(y,) have the property that ky vertices are needed 
to separate xX, and y,; for any separating set in I(x.) or I'(yg) must also be a 
separating set in I). Supposing that neither xg nor yo is adjacent to all 
vertices in W, it follows that both I(x.) and I'(y,) have less vertices than 
I). By the minimality property of I) we then see that in both I'(x,) and 
I'(yo) there are kg independent paths joining x, and y,. Removing from the 
paths in I'(x9) the vertex yo and the (new) edge incident to yo, we get k, paths 
in Ig, each joining x, to a vertex in W such that no vertex except x, belongs 
to more than one of the paths. In a similar way, removing from the paths in 
I'(yo) the vertex x, and the (new) edge incident to x9, we get ky paths inT,, 
each joining yy to a vertex in W such that no vertex except y, belongs to more 
than one of the paths. Now, these 2k, paths go together in pairs, each pair 
having some vertex from W in common. Each such pair determines a path 
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joining x9 and yo. The resulting ky paths joining x, and y, are in fact in- 
dependent. To see this, first note that each vertex in W belongs to exactly one 
of the paths. However, it is also impossible for a vertex not in W to belong to 
two of the paths. Suppose that z was such a common vertex of two paths p, 
and p,. Then by the independence of the ky paths in I'(x,) and the inde- 
pendence of the ky paths in (yo), z had to lie between x, and av, € W onone 
of the paths, say p,, and between yo and a v, € W on the other path p,. But 
then we could construct a path joining x, and yo via z not entering W, which 
is a contradiction. 
To complete the proof of the lemma, let 


XQ Cos Uys C45 U2, ++ +5 Uns Cns Vo (1) 


be a path in Ig joining x, and y. By the non-adjacency of x, and yj, and (a), 
we must haven > 2. Let Ig denote the subgraph of I’, obtained by removing 
from I’, the edge e,. By one of our initial assumptions, only k, — 1 vertices 
are needed to separate x, and yo in Ig. Let W’ be such a separating set of 
vertices in I. Then clearly both 


= W vo tuys 
and 
27= W'Y {uz} 


Separate Xo and yg in Ig. It follows from (a) that u, is not adjacent to yp and 
U, 1S not adjacent to x,. Application of (b) to W{, then shows that each vertex 
in W' is adjacent to xg, and application of (b) to W’, shows that each vertex 
in W’ is adjacent to yg. Since the number of vertices in W’ is ky — 1 whichis at 
least 1, we get a contradiction by appealing to (a). This completes the proof 
of the lemma. LJ 


PRrooF (Theorem A2.1). Suppose first that I is k-connected. Let x,,..., x,_; 
be any k — 1 vertices of I’, and let x and y be any two vertices from 


V' = V\ {x1, sey X,—1}- 


By assumption, x and y can be joined by k independent paths in . None of 
the vertices x,,..., X,-, belongs to more than one of these paths by the 
independence. Hence, at least one of the paths does not pass through any x,. 
This shows that there is a path in the subgraph I’ spanned by V’ which joins 
x and y. In conclusion, I’ is connected. 

To prove the converse, let x and y be any two distinct vertices of I. If x 
and y are non-adjacent, it follows from the assumption that at least k vertices 
are needed to separate x and y. Lemma A2.3 next shows that there are at least 
k independent paths joining x and y, as desired. If x and y are adjacent, we 
argue as follows. Remove from I the edge joining x and y, and call the re- 
sulting graph I”. In I’, the vertices x and y are non-adjacent. Suppose that 
certain k — 2 vertices x,,...,X,—, would separate x and yin I”. In I” there 
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is at least one additional vertex z. Since x and y are separated, z must also 
be separated from at least one of the vertices, say y. But then the k — 1 
vertices X,,..., X,;—2, X separate y and z, which contradicts the assumption. 
Hence, in I at least k — 1 vertices are needed to separate x and y. Lemma 
A2.3 then tells that there are k — 1 independent paths in I joining x and y. 
Together with the path x, e, y, where e denotes the edge joining x and y 
in I’, this makes k independent paths in T joining x and y. CJ 


PROOF (Theorem A2.2). We use Theorem A2.1. Let x,,...,x,-, beanyk — 1 
vertices of [, UIT, 1e. x,,..., X,-, belong to V, U V;, where V, and V, 
denote the vertex set of [, and I’,, respectively. Let 


Vi := (V, WU V3)\{X1, wey X,—1}5 


and let I’ denote the subgraph of IT, UIT’, spanned by V’. Since at least k 
vertices are common to I’, and I,, at least one common vertex, say XQ, is 
distinct from all the x,’s,i = 1,...,k — 1. Application of the “only if” part of 
Theorem A2.1 to I, shows that the subgraph I"; of I, spanned by 


1t= Vi\tX,- 6-5 Xe af 


is connected. In the same manner, the subgraph I, of ’, spanned by 


2 = Vy\{X 4, 0-65 Xe 


is connected. Since I, and I’, have x, as a common vertex, it follows that 
I, UI, 1s connected. Since I, UI, = I’, the desired conclusion follows 
from the “if” part of Theorem A2.1. LJ 


Finally, we shall say a few words about oriented graphs. The intuitive 
picture of an oriented graph is that of a (non-oriented) graph as described 
above where, in addition, each edge is equipped with an “orientation”. 
Formally, this may be stated as follows: an oriented graph is a triple T = 
(V, E, y), where V (called the set of vertices of T) is a non-empty finite set, E 
(called the set of edges of I) is a set (necessarily finite), and y (called the 
incidence relation of I) is a one-to-one mapping from E onto a subset of the 
set of all ordered pairs (x, y) of two distinct elements of V; moreover, we 
require that if x and y are distinct vertices with y(e) = (x, y) for some edge 
e, then y(e’) # (y, x) for all edges e’. 

Fach oriented graph [ = (V, E£, y) has an underlying non-oriented graph 
I" = (V, E, y’), whereby y’(e) = {x, y} when y(e) = (x, y) or y(e) = Vy, x). 
Therefore, everything that we have said above about non-oriented graphs 
also applies to oriented graphs, in the sense that it applies to the underlying 
non-oriented graph. 

When I = (V, E, y) is an oriented graph, and y(e) = (x, y), then we say 
that the edge e is oriented towards y and away from x. The number of edges 
oriented towards x is called the in-valence of x, and the number of edges 
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oriented away from x is called the out-valence of x. Hence, the sum of the 
in-valence of x and the out-valence of x equals the valence of x. 

Each non-oriented graph may be turned into an oriented graph by 
choosing for each edge of the graph one of the two possible orientations of 
that edge. Formally, this means that if [ = (V, E, y) is a non-oriented graph, 
then we get an oriented graph I” = (V, E, y’) by choosing y’ such that y’(e) = 
(x,y) or y'(e) = (y, x) whenever y(e) = {x, y}. Of course, y’ is not unique 
(unless I’ has no edges at all). 


APPENDIX 3 
Combinatorial Identities 


In the main text we shall need certain identities involving binomial coefficients. 
The purpose of the present section is to give a unified exposition of these 
identities. 


In the following, a, b and c always denote integers. Moreover, we always 
assume b > O, whereas a and c may be negative. 
Recall that 


(5) <= 1)---(a—b+1) >I 


b b! 


and 
In particular, 


and 


144 App. 3. Combinatorial Identities 


If a > 0, then 


a 
b 
equals the number of choices of b elements among a elements. 
We shall leave it to the reader to verify the following: 


( : ) 7 a (;) (4) 
)=(.53)+ (Ch (5) 
() () - () (: i ) (6) 


(In (1), (3) and (6) it is understood that a > b.) 


The combinatorial identities that we need in the main text are all con- 
sequences of the following basic identity, known as the Vandermonde 


Convolution: 
> fa Cc at+c 
— 7 
Sle ® ( b ” 


For a,c > 0, this is easy to prove. In fact, 


le ) 


is the number of choices of b elements among a + c elements such that k 
elements are chosen among certain a elements and the remaining b — k 
elements are chosen among the remaining c elements; summing over k then 
yields (7). However, we need (7) for arbitrary integers a and c. We prove it by 
induction on b. For b = 0, it is trivial. Suppose that it holds for b. Then, 
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using also (4), we have 


E(ills+1—x) 


es k ,b+ilak a C 
= 2 b+1 b+1 kKJ\b+1—k 


k=0 
> k fa b> b+1-—k fa C 
b+1 (ds «1 J+ 2, bri \kJ\b+1—k 
po h+i C b> b+1—k C 
fhe Woe. caen)td, ber b+ (Neai a) 
b a—1 a\{c—1 
Saat Yes) 2aevk— 4) 
a [{(a-—1)+c n C aren” 
—b+1 b b+1 
ate at+tc-—l 
b+ 1 b 
_ fate 
lbs) 


Hence, (7) holds for b + 1, as desired. 
Taking c = —1 in (7) and using the fact that by definition 


(,— =(-pr™, 
we get 

nf 2\ a-—1 
nl) 

b—a 

34) : 

atk ,{-~a—1 
Cec’) 


cf. (2), we see that (8) is equivalent to 


y ("% + ‘) - _ oy ’ 


a+b+1 
- ( h } (9) 


Since 
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We shall next prove: 


b L[k\ (c ,(¢\fc-a-1 
Yen (Ne) =< (:)( _ } 0<ach. 


The proof of this uses (6) and (8). We have 


2-r()G) = Zn) 
Er) 


| 
Th et 
—_, 
peek 
we” 
= 
ee 
QQ 
“ee” 
ae 
> a a 
| | 
Q & 
“ee” 


completing the proof of (10). 
A particular case of (10) is the following: 


Yc (i)({) = pre@o O<a<b, 0<cK<b. 


In fact, if a = c, then 


»(c\fe-—a-— 1 _ —| 
al es aa 


= (-1)(-1)"4 
=(-1)°; 


()- 


and ifa <c, then0 <c —a—1 < b — a, whence 


ifa > c, then 


This completes the proof of (11). 
Our final identity is the following: 


Leo, . ; = —r(; - :) O0<a<b. 


(10) 


(11) 


(12) 
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b fk C 

Em Nlo<s) 

_4{7a-1 C 

LED ( k—a N(, — ; 
aw {-a-l C 

(—1) > ( h iw) — a) 


_(. (4? + ‘ 


we eee oN) 


Using (3), (7) and (2), we have 


2 Mallo 1) = 


— (__1)4f _1)\b-4 
=(-1)(-1) ( »_, 


b—c 
-uft=9) 


completing the proof of (12). 

The combinatorial identities of this section may be interpreted as state- 
ments about products of matrices. As an important example, let us consider 
the identity (11). Let B and D denote the (b + 1) x (6 + 1) matrices 


B:=((- v() | 
LU} Ji=O,..., bs j=0,...,b 

D:= (/ | 

lL} J i=0,...,b;j=0,...,b 


Then the identity (11) tells that B and D are mutually inverse matrices. 
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and D. W. Walkup [19]; here the necessity of condition (b) of Theorem 20.1 
is conjectured and it shown that (a) and (b) imply the Lower Bound Theorem. 

The sufficiency of McMullen’s conditions was established by L. J. Billera 
and C. W. Lee [4, 5]. In their proof (which is formulated in the setting of 
simplicial polytopes) they produce a simplicial d-polytope with a given 
f =(o;---s fa-1) as its f-vector by taking the vertex-figure at z ofa(d + 1)- 
polytope of the form conv(Q vu {z}), where Q is a suitably chosen cyclic 
(d + 1)-polytope and z is a suitably chosen point outside Q. 

The necessity of McMullen’s conditions was established by R. P. Stanley 
[22]. Stanley’s proof (which is formulated in the setting of simplicial poly- 
topes) uses advanced algebraic geometry; it would be very desirable to have a 
more elementary proof. 

A conjecture on the characterization of f-vectors of simple unbounded 
polyhedral sets has been formulated by L. J. Billera and C. W. Lee [6]. 

It has been conjectured that the f-vectors of simplicial (or simple) d- 
polytopes P are unimodal, Le. 


fo(P) <= fi(P) S++ S ACP) => fea i(P) 2 -°+ = fa-1(P) 


for some k. (The necessity of (a) and (b) in Theorem 20.1 shows that 
(9o(P), .--, gg(P)) is unimodal for every simple d-polytope P.) According to 
A. Bjorner [8], it can be shown that for any simplicial d-polytope P 


fo(P) < fi(P) < +++ < flajai—i1€P) < fiay2i(P). 
fisa—1y4i(P) > +--+ > fa-2(P) > fa-1(P). 


These inequalities immediately imply unimodality for d < 8. It is even 
possible to show unimodality for all d < 15 by checking each d separately, 
cf. [8]. But unimodality does not hold in general: one knows 20-dimensional 
simplicial polytopes P (with on the order of 10/° vertices) such that 
fii€P) > fir2(P) < f13(P), cf. [5], [8]. 

To conclude the comments on Chapters 1-3, let us mention, without going 
into detail, that many combinatorial results about convex polytopes admit 
extensions to more general geometric objects. 


150 Bibliographical Comments 


Our exposition of Menger’s Theorem (Theorem A2.1 of Appendix 2) is 
based on the book of B. Bollobas [9] which we also recommend as a general 
reference to graph theory. As a general reference to combinatorial identities 
(Appendix 3) we recommend the book of J. Riordan [20]. 
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